WORLD INTELLECTUAL PROPERTY ORGANIZATION 

Internationa] Bureau 




PCX 

INTERNATIONAL APPUCATION PUBUSHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classificatidn ^ : 

CUN 15700, C07K 14/47, GOIN 33/53 



A2 



(11) International PobUcation Number: WO 00/52151 

(43) Internattonal Publication Date: 8 September 2000 (08.09.00) 



(21) International AppUcation Number: P(nyUS0(V0S621 

(22) International FHing Date: 3 March 2000 (03.03.00) 



(30) Priority Data: 
60/123.117 



5 Nfarch 1999 (OS.03.99) 



US 



(63) Related by Continuation (CON) or Continuation-in-Part 
(CIF) to Earlier Application 

US 60/123,117 (CIP) 

Filed on S March 1999 (05.03.99) 



(71) Applicant (for all designated States except US): INCYTE 

PHARMACEUTICALS, INC, [USAJS]; 3160 Porter Drive, 
Palo Alto, CA 94304 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): TANG, Y., Tom [CN/US]; 
4230 Ranwick Court, San Jose, CA 951 18 (US). LAL, Precti 
(IN/US]; 2382 Lass Drive, Santa Clara, CA 95054 (US). 
BAUGHN. Mariah, R. [US/US]; 14244 Santiago Road, 
San Leandro, CA 94577 (US). YUE, Henry [US/US]; 826 
Lois Avenue, Sunnyvale, CA 94087 (US). AU-YOUNG, 
Janice [US/US]; 233 Golden Eagle Lane, Brisbane, CA 
94005 (US). LU, Dyung, Aina, M. [USAUS]; 55 Parte 



Belmont Place, San Jose, CA 95 136 (US). AZIMZAI, Yalda 
[US/US]; 2045 Rock Springs Drive, Haywaid, CA 94545 
(US). 

(74) Agents: HAMLET-COX, Diana et aU Incyte Pharmaceuticals, 
Inc.. 3160 Porter Drive, Palo Alto, CA 94304 (US). 



(81) Designated States: AE, AL, AM, AT, AU, AZ. BA, BB, BG, 
BR. BY, CA. CH. CN. CU, CZ, DE, DK, EE, ES, FI, GB, 
GD. GE, GH, GM. HR. HU. ID. JL, IN, IS. JP, KE, KG, 
KP, KR, KZ, LC, LK, LR, LS, LT. LU, LV, MD, MG. MK, 
MN, MW, MX, NO, NZ, PL. PT. RO, RU, SD. SE. SG, 
SI, SK, SL, TJ, TM, TR. TF, UA, UG, US, UZ, VN, YU, 
ZA. ZW. ARIPO patent (GH, GM. KE, LS, MW, SD, SL, 
SZ, TZ, UG, ZW), Eurasian patent (AM, AZ, BY, KG, KZ, 
MD, RU, TJ. TM), Eurtjpean patent (AT, BE, CH, CY. DE, 
DK, ES, n, FR, GB, GR, IE, IT, LU. MC, NL, PT, SE), 
OAPI patent (BF, BJ, CF, CG, Q, CM, GA, GN, GW, ML, 
MR. NE, SN. TD, TG). 



Published 

Without international search report and to be reptiblished 
upon receipt of that report. 



(54) Tide: HUMAN SECRETORY PROTEINS 
(57) Abstract 



The invention provides human secretory proteins (HSECP) and polynucleotides which identify and encode HSECP. The invention 
also provides expression vectors, host cells, antibodies, agonists, and antagonists. The invention' also provides methods for diagnosing, 
treating, or preventing disorders associated with expression of HSECP. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Sbvenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


PR 


France 


LU 


Lnxembouig 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


AscilMijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


ttlLlllLlljtjLJl 

isarraaos 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tapkistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yngoslav 


TM 


Ttokmenistan 


BP 


BntkinaFaso 


GR 






Republic of Macedonia 


TR 


Turkey 


BG 


Bolgnria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CP 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Vict Nam 


CG 


Congo 


KE 


Kenya 


NL 


Nedierlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


zw 


Slimbabwe 


CI 


C6te d'lvoire 


KP 


Demociatic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


PoitagtA 






CU 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






CZ 


Czech Re|Rib1ic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


U 




SD 


Sodan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Ubefia 


SG 


Singapore 







PCrAJSOO/05621 
HUMAN SECRETORY PROTEINS 

TECHNICAL FIELD 
This invention relates to nucleic acid and amino acid sequences of human secretory proteins 
5 and to the use of these sequences in the diagnosis, treatment, and prevention of cancer, inflammation, 
and gastrointestinal, cardiovascular, and neurological disorders. 

BACKGROUND OF THE INVENTION 

Protein transport and secretion are essential for cellular function. Protein transport is 

10 mediated by a signal peptide located at the amino terminus of the protein to be transported or 
secreted. The signal peptide is comprised of about ten to twenty hydrophobic amino acids which 
target the nascent protein from the ribosome to a particular membrane bound compartment such as the 
endoplasmic reticulum (ER). Proteins targeted to the ER may either proceed through the secretory 
pathway or remain in any of the secretory organelles such as the ER, Golgi apparatus, or lysosomes. 

15 Proteins that transit through the secretory pathway are either secreted into the extracellular space or 
retained in the plasma membrane. Secreted proteins are often synthesized as inactive precursors that 
are activated by post-translational processing events during transit throu^ the secretory pathway. 
Such events include glycosylation, proteolysis, and removal of the signal peptide by a signal 
peptidase. Other events that may occur during protein transport include chaperone-dependent 

20 unfolding and folding of the nascent protein and interaction of the protein with a receptor or pore 
complex. Examples of secreted proteins with amino terminal signal peptides are discussed below and 
include receptors, extracellular matrix molecules, cytokines, hormones, growth and differentiation 
factors, neuropeptides, vasomediators, ion channels, transporters/pumps, and proteases. (Reviewed in 
Alberts, B. et al. (1994) Molecular Bioloev of The Cell. Garland Publishing, New York, NY, pp. 557- 

25 560, 582-592.) 

G-protein coupled receptors (GPCRs) comprise a superfamily of integral membrane proteins 
which transduce extracellular signals. Not all GPCRs contain N-terminal signal peptides. GPCRs 
include receptors for biogenic amines such as dopamine, epinephrine, histamine, glutamate 
(metabotropic-type), acetylcholine (muscarinic-type), and serotonin; for lipid mediators of 

30 inflammation such as prostaglandins, platelet activating factor, and leukotrienes; for peptide 

hormones such as calcitonin, C5a anaphylatoxin, follicle stimulating hormone, gonadotropin releasing 
hormone, neurokinin, oxytocin, and thrombin; and for sensory signal mediators such as retinal 
photopigments and olfactory stimulatory molecules. The structure of these highly conserved 
receptors consists of seven hydrophobic transmembrane regions, cysteine disulfide bridges between 

35 the second and third extracellular loops, an extracellular N-terminus, and a cytoplasmic C-terminus. 
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The N-terminus interacts with ligands, the disulfide bridges interact with agonists and antagonists, 
and the large third intracellular loop interacts with G proteins to activate second messengers such as 
cyclic AMP, phospholipase C, inositol triphosphate, or ion channels. (Reviewed in Watson, S. and 
Arkinstall, S. (1994) The G-protein Linked Receptor Facts Book . Academic Press. San Diego, CA> 
5 pp. 2-6; and Bolander, RF. (1994) Molecular Endocrinology, Academic Press, San Diego, CA, pp. 
162-176.) 

Other types of receptors include cell surface antigens identified on leukocytic cells of the 
immune system. These antigens have been identified using systematic, monoclonal antibody (mAb)- 
based "shot gun" techniques. These techniques have resulted in the production of hundreds of mAbs 

10 directed against unknown cell surface leukocytic antigens. These antigens have been grouped into 
"clusters of differentiation" based on common immunocytochemical localization patterns in various 
differentiated and undifferentiated leukocytic cell types. Antigens in a given cluster are presumed to 
identify a single cell surface protein and are assigned a "cluster of differentiation" or "CD" 
designation. Some of the genes encoding proteins identified by CD antigens have been cloned and 

15 verified by standard molecular biology techniques . CD antigens have been characterized as both 
transmembrane proteins and cell surface proteins anchored to the plasma membrane via covalent 
attachment to fatty acid-containing glycolipids such as glycosylphosphatidylinositol (GPI). 
(Reviewed in Barclay, A. N. et al. (1995) The Leucocyte Antigen Facts Book . Academic Press, San 
Diego, CA, pp. 17-20.) 

20 Matrix proteins (MPs) are transmembrane and extracellular proteins which function in 

formation, growth, remodeling, and maintenance of tissues and as important mediators and regulators 
of the inflammatory response. The expression and balance of MPs may be perturbed by biochemical 
changes that result from congenital, epigenetic, or infectious diseases. In addition, MPs affect 
leukocyte migration, proliferation, differentiation, and activation in the immune response. MPs are 

25 frequently characterized by the presence of one or more domains which may include collagen-like 
domains, EGF-like domains, immunoglobulin-like domains, and fibronectin-like domains. In 
addition, MPs may be heavily glycosylated and may contain an Arginine-Glycine-Aspartate (RGD) 
tripeptide motif which may play a role in adhesive interactions. MPs include extracellular proteins 
such as flbronectin, collagen, galectin, vitronectin and its proteolytic derivative somatomedin B; and 

30 cell adhesion receptors such as cell adhesion molecules (CAMs), cadherins, and integrins. (Reviewed 
in Ayad, S. et al. (1994) The Extracellular Matrix Facts Book . Academic Press, San Diego, CA, pp. 2- 
16; Ruoslahti, E. (1997) Kidney Int. 51:1413-1417; Sjaastad, M.D. and Nelson, W.J. (1997) 
BioEssays 19:47-55.) 

Cytokines are secreted by hematopoietic cells in response to injury or infection. Interleukins, 
35 neurotrophins, growth factors, interferons, and chemokines all define cytokine families that work in 
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conjunction with cellular receptors to regulate cell proliferation and differentiation. In addition, 
cytokines effect activities such as leukocyte migration and function, hematopoietic cell proliferation, 
temperature regulation, acute response to infection, tissue remodeling, and apoptosis. 

Chemokines, in particular, are small chemoattractant cytokines involved in inflanunation, 
5 leukocyte proliferation and migration, angiogenesis and angiostasis, regulation of hematopoiesis, HIV 
infectivity, and stimulation of cytokine secretion. Chemokines generally contain 70-100 amino acids 
and are subdivided into four subfamilies based on the presence of conserved cysteine-based motifs. 
(Callard. R. and Gearing, A. (1994) The Cytokine Facts Book. Academic Press, New York, NY, pp. 
181.190,210-213,223-227.) 

10 Growth and differentiation factors are secreted proteins which function in intercellular 

communication. Some factors require oligomerization or association with MPs for activity. Complex 
interactions among these factors and their receptors trigger intracellular signal transduction pathways 
that stimulate or inhibit cell division, cell differentiation, cell signaling, and cell motility. Most 
growth and differentiation factors act on cells in their local environment (paracrine signaling). There 

15 are three broad classes of growth and differentiation factors. The first class includes the large 
polypeptide growth factors such as epidermal growth factor, fibroblast growth factor, transforming 
growth factor, insulin-like growth factor, and platelet-derived growth factor. The second class 
includes the hematopoietic growth factors such as the colony stimulating factors (CSFs). 
Hematopoietic growth factors stimulate the proliferation and differentiation of blood cells such as B- 

20 lymphocytes, T-lymphocytes, erythrocytes, platelets, eosinophils, basophils, neutrophils, 

macrophages, and their stem cell precursors. The third class includes small peptide factors such as 
bombesin, vasopressin, oxytocin, endothelin, transferrin, angiotensin II, vasoactive intestinal peptide, 
and bradykinin which function as hormones to regulate cellular functions other than proliferation. 

Growth and differentiation factors play critical roles in neoplastic transformation of cells in 

25 vitro and in tumor progression in vivo . Inappropriate expression of growth factors by tumor cells may 
contribute to vascularization and metastasis of tumors. During hematopoiesis, growth factor 
misregulation can result in anemias, leukemias, and lymphomas. Certain growth factors such as 
interferon are cytotoxic to tumor cells both in vivo and in vitro . Moreover, some growth factors and 
growth factor receptors are related both structurally and functionally to oncoproteins. In addition, 

30 growth factors affect transcriptional regulation of both proto-oncogenes and oncosuppressor genes. 
(Reviewed in Pimentel, E. (1994) Handbook of Growth Factors . CRC Press, Ann Arbor, MI, pp. 1-9.) 

Proteolytic enzymes or proteases either activate or deactivate proteins by hydrolyzing peptide 
bonds. Proteases are found in the cytosol, in membrane-bound compartments, and in the extracellular 
space. The major families are the zinc, serine, cysteine, thiol, and carboxyl proteases. 

35 Ion channels, ion pumps, and transport proteins mediate the transport of molecules across 
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cellular membranes. Transport can occur by a passive, concentration-dependent mechanism or can be 
linked to an energy source such as ATP hydrolysis. Symporters and antiporters transport ions and 
small molecules such as amino acids, glucose, and drugs. Symporters transport molecules and ions 
unidirectionally, and antiporters transport molecules and ions bidirectionally. Transporter 
5 superfamilies include facilitative transporters and active ATP-binding cassette transporters which are 
involved in multiple-drug resistance and the targeting of antigenic peptides to MHC Class I 
molecules. These transporters bind to a specific ion or other molecule and undergo a conformational 
change in order to transfer the ion or molecule across the membrane. (Reviewed in Alberts, B. et al. 
(1994) Molecular Bioloev of The CelL Garland Publishing. New York, NY, pp. 523-546.) 

10 Ion channels are formed by transmembrane proteins which create a lined passageway across 

the membrane through which water and ions, such as Na^ K\ Ca^^ and Cr, enter and exit the cell. 
For example, chloride channels are involved in the regulation of the membrane electric potential as 
well as absorption and secretion of ions across the membrane. Chloride channels also regulate the 
internal pH of membrane-bound organelles. 

15 Ion pumps are ATPases which actively maintain membrane gradients. Ion pumps are 

classified as P, V, or F according to their structure and function. All have one or more binding sites 
for ATP in their cytosolic domains. The P-class ion pumps include Ca^* ATPase and Na*/K* ATPase 
and function in transporting H*, Na*, K*, and Ca^* ions. P-class pumps consist of two a and two p 
transmembrane subunits. The V- and F-class ion pumps have similar structures but transport only H^ 

20 F class H*^ pumps mediate transport across the membranes of mitochondria and chloroplasts, while V- 
class H**^ pumps regulate acidity inside lysosomes, endosomes, and plant vacuoles. 

A family of structurally related intrinsic membrane proteins known as facilitative glucose 
transporters catalyze the movement of glucose and other selected sugars across the plasma membrane. 
The proteins in this family contain a highly conserved, large transmembrane domain comprised of 12 

25 a-helices, and several weakly conserved, cytoplasmic and exoplasmic domains. (Pessin, J. E., and 
Bell, G.L (1992) Annu. Rev. Physiol. 54:911-930.) 

Amino acid transport is mediated by Na*^ dependent amino acid transporters. These 
transporters are involved in gastrointestinal and renal uptake of dietary and cellular amino acids and 
in neuronal reuptake of neurotransmitters. Transport of cationic amino acids is mediated by the 

30 system y+ family and the cationic amino acid transporter (CAT) family. Members of the CAT family 
share a high degree of sequence homology, and each contains 12-14 putative transmembrane 
domains. (Ito, K. and Groudine, M. (1997) J. Biol. Chem. 272:26780-26786.) 

Hormones are secreted molecules that travel through the circulation and bind to specific 
receptors on the surface of, or within, target cells. Although they have diverse biochemical 

35 compositions and mechanisms of action, hormones can be grouped into two categories. One category 
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includes small lipophilic hormones that diffuse through the plasma membrane of target cells, bind to 
cytosolic or nuclear receptors, and form a complex that alters gene expression. Examples of these 
molecules include retinoic acid, thyroxine, and the cholesterol-derived steroid hormones such as 
progesterone, estrogen, testosterone, Cortisol, and aldosterone. The second category includes 

5 hydrophilic hormones that function by binding to cell surface receptors that transduce signals across 
the plasma membrane. Examples of such hormones include amino acid derivatives such as 
catecholamines and peptide hormones such as glucagon, insulin, gastrin, secretin, cholecystokinin, 
adrenocorticotropic hormone, follicle stimulating hormone, luteinizing hormone, thyroid stimulating 
hormone, and vasopressin. (See, for example, Lodish et al. (1995) Molecular Cell Biologv . Scientific 

10 American Books Inc., New York, NY, pp. 856-864.) 

Neuropeptides and vasomediators (NP/VM) comprise a large family of endogenous signaling 
molecules. Included in this family are neuropeptides and neuropeptide hormones such as bombesin, 
neuropeptide Y, neurotensin, neuromedin N, melanocortins, opioids, galanin, somatostatin, 
tachykinins, urotensin II and related peptides involved in smooth muscle stimulation, vasopressin, 

15 vasoactive intestinal peptide, and circulatory system-bome signaling molecules such as angiotensin, 
complement, calcitonin, endothelins, formyl-n^thionyl peptides, glucagon, cholecystokinin and 
gastrin. NP/VMs can transduce signals directly, modulate the activity or release of other 
neurotransmitters and hormones, and act as catalytic enzymes in cascades. The effects of NPA^Ms 
range from extremely brief to long-lasting. (Reviewed in Martin, C. R. et al. (1985) Endocrine 

20 Phvsiolopv , Oxford University Press, New York, NY, pp. 57-62.) 

The discovery of new human secretory proteins and the polynucleotides encoding them 
satisfles a need in the art by providing new compositions which are useful in the diagnosis, 
prevention, and treatment of cancer, inflammation, and gastrointestinal, cardiovascular, and 
neurological disorders. 

25 

SUMMARY OF THE INVENTION 
The invention features purified polypeptides, human secretory proteins, referred to 
collectively as "HSECF* and individually as "HSECP-1," "HSECP-2," "HSECP-3," "HSECP-4," 
"HSECP.5," "HSECP-6," "HSECP-7," "HSECP-8," "HSECP-9," "HSECP-10,^' "HSECP-1 1," 

30 "HSECP-12," "HSECP-13." "HSECP-14," "HSECP-15," "HSECP-16," "HSECP-17,*' "HSECP-18," 
"HSECP-19," "HSECP-20," "HSECP-21" and "HSECP-22." In one aspect, the invention provides an 
isolated polypeptide comprising a) an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-22, b) a naturally occurring amino acid sequence having at least 90% sequence identity to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-22, c) a biologically active 

35 fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-22, or d) an 
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immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-22. In one alternative, the invention provides an isolated polypeptide comprising the amino 
acid sequence of SEQ ID NO: 1-22. 

The invention further provides an isolated polynucleotide encoding a polypeptide comprising 

5 a) an amino acid sequence selected from the group consisting of SEQ ID NO: 1-22, b) a naturally 
occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-22, c) a biologically active fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: I -22, or d) an immunogenic 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-22. In one 

10 alternative, the polynucleotide is selected from the group consisting of SEQ ID NO:23^. 

Additionally, the invention provides a recombinant polynucleotide comprising a promoter 
sequence operably linked to a polynucleotide encoding a polypeptide comprising a) an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1 -22, b) a naturally occurring amino acid 
sequence having at least 90% sequence identity to an amino acid sequence selected from the group 

15 consisting of SEQ ID NO: 1-22, c) a biologically active fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-22, or d) an inununogenic fragment of an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-22. In one alternative, the invention 
provides a cell transformed with the recombinant polynucleotide. In another alternative, the invention 
provides a transgenic organism comprising the recombinant polynucleotide. 

20 The invention also provides a method for producing a polypeptide comprising a) an amino 

acid sequence selected from the group consisting of SEQ ED NO: 1 -22, b) a naturally occurring amino 
acid sequence having at least 90% sequence identity to an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-22, c) a biologically active fragment of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-22, or d) an immunogenic fragmmt of an amino 

25 acid sequence selected from the group consisting of SEQ ID NO: 1-22. The method comprises a) 
culturing a cell under conditions suitable for expression of the polypeptide, wherein said cell is 
transformed with a recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide encoding the polypeptide, and b) recovering the polypeptide so expressed. 

Additionally, the invention provides an isolated antibody which specifically binds to a 

30 polypeptide comprising a) an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-22, b) a naturally occurring amino acid sequence having at least 90% sequence identity to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-22, c) a biologically active 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-22, or d) an 
immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID 

35 NO:l-22. 
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The invention further provides an isolated polynucleotide comprising a) a polynucleotide 
sequence selected from the group consisting of SEQ ID NO:23-44, b) a natxirally occurring 
polynucleotide sequence having at least 70% sequence identity to a polynucleotide sequence selected 
from the group consisting of SEQ ID NO:23-44, c) a polynucleotide sequence complementary to a), 

.5 or d) a polynucleotide sequence complementary to b). In one alternative, the polynucleotide 
comprises at least 60 contiguous nucleotides. 

Additionally, the invention provides a method for detecting a target polynucleotide in a 
sample, said target polynucleotide having a sequence of a polynucleotide conq)rising a) a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:23-44, b) a naturally 

10 occurring polynucleotide sequence having at least 70% sequence identity to a polynucleotide 
sequence selected from the group consisting of SEQ ID NO:23-44, c) a polynucleotide sequence 
complementary to a), or d) a polynucleotide sequence complementary to b). The method comprises a) 
hybridizing the sample with a probe comprising at least 16 contiguous nucleotides comprising a 
sequence complementary to said target polynucleotide in the sample, and which probe specifically 

15 hybridizes to said target polynucleotide, under conditions whereby a hybridization complex is formed 
between said probe and said target polynucleotide, and b) detecting the presence or absence of said 
hybridization complex, and optionally, if present, the amount thereof. In one alternative, the probe 
comprises at least 30 contiguous nucleotides. In another alternative, the probe comprises at least 60 
contiguous nucleotides. 

20 The invention further provides a pharmaceutical composition comprising an effective amount 

of a polypeptide comprising a) an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-22, b) a naturally occurring amino acid sequence having at least 90% sequence identity to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-22, c) a biologically active 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-22, or d) an 

25 immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID 

NO: 1-22, and a pharmaceutically acceptable excipient. The invention additionally provides a method 
of treating a disease or condition associated with decreased expression of functional HSECP, 
comprising administering to a patient in need of such treatment the pharmaceutical composition. 
The invention also provides a method for screening a compound for effectiveness as an 

30 agonist of a polypeptide comprising a) an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-22, b) a naturally occurring amino acid sequence having at least 90% sequence identity 
to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-22, c) a biologically 
active fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-22, or 
d) an inmiunogenic fragment of an amino acid sequence selected from the group consisting of SEQ 

35 ID NO: 1-22. The method comprises a) exposing a sample comprising the polypeptide to a 
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compound, and b) detecting agonist activity in the sample. In one alternative, the invention provides 
a pharmaceutical composition comprising an agonist compound identiHed by the method and a 
pharmaceutically acceptable excipient. In another alternative, the invention provides a method of 
treating a disease or condition associated with decreased expression of functional HSECP, 
S comprising administering to a patient in need of such treatment the pharmaceutical composition. 

Additionally, the invention provides a method for screening a compound for effectiveness as 
an antagonist of a polypeptide comprising a) an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-22, b) a naturally occurring amino acid sequence having at least 90% 
sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-22, 

10 c) a biologically active fragment of an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-22, or d) an inmiunogenic fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-22. The method comprises a) exposing a sample comprising the 
polypeptide to a compound, and b) detecting antagonist activity in the sample. In one alternative, the 
invention provides a pharmaceutical composition comprising an antagonist compound identified by 

15 the method and a pharmaceutically acceptable excipient. In another alternative, the invention 
provides a method of treating a disease or condition associated with overexpression of functional 
HSECP, comprising administering to a patient in need of such treatment the pharmaceutical 
composition. 

The invention further provides a method for screening a compound for effectiveness in 
20 altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
sequence selected from the group consisting of SEQ ID NO:23-44, the method comprising a) 
exposing a sample comprising the target polynucleotide to a compound, and b) detecting altered 
expression of the target polynucleotide. 

25 BRIEF DESCRIPTION OF THE TABLES 

Table 1 shows polypeptide and nucleotide sequence identification numbers (SEQ ID NOs), 
clone identification numbers (clone IDs), cDNA libraries, and cDNA fragments used to assemble full- 
length sequences encoding HSECP. 

Table 2 shows features of each polypeptide sequence, including predicted signal peptides and 
30 other motifs, and methods, algorithms, and searchable databases used for analysis of HSECP. 

Table 3 shows selected fragments of each nucleic acid sequence; the tissue-specific 
expression patterns of each nucleic acid sequence as determined by northern analysis; diseases, 
disorders, or conditions associated with these tissues; and the vector into which each cDNA was 
cloned. 

35 Table 4 describes the tissues used to construct the cDNA libraries from which cDNA clones 
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encoding HSECP were isolated. 

Table S shows the tools, programs, and algorithms used to analyze HSECP, along with 
applicable descriptions, references, and threshold parameters. 

5 DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is understood 
that this invention is not limited to the particular machines, materials and methods described, as these 
may vary. It is also to be understood that the terminology used herein is for the purpose of describing 
particular embodiments only, and is not intended to limit the scope of the present invention which 

10 will be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms *'a," '*an," 
and "the" include plural reference unless the context cleariy dictates otherwiise. Thus, for example, a 
reference to "a host cell" includes a plurality of such host cells, and a reference to "an antibody" is a 
reference to one or more antibodies and equivalents thereof known to those skilled in the art, and so 

15 forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same 
meanings as commonly understood by one of ordinary skill in the art to which this invention belongs. 
Although any machines, materials, and methods similar or equivalent to those described herein can be 
used to practice or test the present invention, the preferred machines, materials and methods are now 
20 described. All publications mentioned herein are cited for the purpose of describing and disclosing 
the cell lines, protocols, reagents and vectors which are reported in the publications and which might 
be used in connection with the invention. Nothing herein is to be construed as an admission that the 
invention is not entitled to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

25 "HSECP" refers to the amino acid sequences of substantially purified HSECP obtained from 

any species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 
human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant. 

The term "agonist" refers to a molecule which intensifies or mimics the biological activity of 
HSECP. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 

30 compound or composition which modulates the activity of HSECP either by directly interacting with 
HSECP or by acting on components of the biological pathway in which HSECP participates. 

An "allelic variant" is an alternative form of the gene encoding HSECP. Allelic variants may 
result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
polypeptides whose structure or function may or may not be altered. A gene may have none, one, or 

35 many allelic variants of its naturally occurring form. Common mutational changes which give rise to 
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allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
Each of these types of changes may occur alone, or in combination with the others, one or more times 
in a given sequence. 

"Altered" nucleic acid sequences encoding HSECP include those sequences with deletions, 
5 insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as HSECP or a 
polypeptide with at least one functional characteristic of HSECP. Included within this defmition are 
polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe 
of the polynucleotide encoding HSECP, and improper or unexpected hybridization to allelic variants, 
with a locus other than the normal chromosomal locus for the polynucleotide sequence encoding 

10 HSECP. The encoded protein may also be "altered," and may contain deletions, insertions, or 
substitutions of amino acid residues which produce a silent change and result in a functionally 
equivalent HSECP. Deliberate amino acid substitutions may be made on the.basis of similarity in 
polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the 
residues, as long as the biological or immunological activity of HSECP is retained. For example, 

15 negatively charged amino acids may include aspartic acid and glutamic acid, and positively charged 
amino acids may include lysine and arginine. Amino acids with uncharged polar side chains having 
similar hydrophilicity values may include: asparagine and glutamine; and serine and threonine. 
Amino acids with uncharged side chains having similar hydrophilicity values may include: leucine, 
' isoleucine, and valine; glycine and alanine; and phenylalanine and tyrosine. 

20 The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, 

polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic 
molecules. Where "amino acid sequence" is recited to refer to an amino acid sequence of a namrally 
occurring protein molecule, "amino acid sequence" and like terms are not meant to limit the amino 
acid sequence to the complete native amino acid sequence associated with the recited protein 

25 molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
Amplification is generally carried out using polymerase chain reaction (PCR) technologies well 
known in the art. 

The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity 
30 of HSECP. Antagonists may include proteins such as antibodies, nucleic acids, carbohydrates, small 
molecules, or any other compound or composition which modulates the activity of HSECP either by 
directly interacting with HSECP or by acting on components of the biological pathway in which 
HSECP participates. 

The term "antibody" refers to intact inmiunoglobulin molecules as well as to fragments 
35 thereof, such as Fab, F(ab')2, and Fv fragments, which are capable of binding an epitopic determinant. 



10 



wo 00/52151 



PCTAJSOO/05621 



Antibodies that bind HSECP polypeptides can be prepared using intact polypeptides or using 
fragments containing small peptides of interest as the immunizing antigen. The polypeptide or 
oligopeptide used to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the 
translation of RNA, or synthesized chemically, and can be conjugated to a carrier protein if desired. 
5 Commonly used carriers that are chemically coupled to peptides include bovine serum albumin, 

thyroglobulin, and keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize 
the animal. 

The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that 
makes contact with a particular antibody. When a protein or a fragment of a protein is used to 

10 immunize a host animal, numerous regions of the protein may induce the production of antibodies 
which bind specifically to antigenic determinants (particular regions or three-dimensional structures 
on the protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen 
used to elicit the inrunune response) for binding to an antibody. 

The term "antisense" refers to any composition capable of base-pairing with the "sense" 

15 strand of a specific nucleic acid sequence. Antisense compositions may include DNA; RNA; peptide 
nucleic acid (PNA); oligonucleotides having modified backbone linkages such as phosphorothioates, 
methylphosphonates, or benzylphosphonates; oligonucleotides having modified sugar groups such as 
2 -methoxyethyl sugars or 2'-methoxyethoxy sugars; or oligonucleotides having modified bases such 
as 5-methyl cytosine, 2*-deoxyuracil, or 7-deaza-2'-deoxyguanosine. Antisense molecules may be 

20 produced by any method including chemical synthesis or transcription. Once introduced into a cell, 
the complementary antisense molecule base-pairs with a naturally occurring nucleic acid sequence 
produced by the cell to fomri duplexes which block either transcription or translation. The 
designation "negative" or "minus" can refer to the antisense strand, and the designation "positive" or 
"plus" can refer to the sense strand of a reference DNA molecule. 

25 The term "biologically active" refers to a protein having structural, regulatory, or biochemical 

functions of a naturally occurring molecule. Likewise, "immunologically active" refers to the 
capability of the natural, recombinant, or synthetic HSECP, or of any oligopeptide thereof, to induce 
a specific immiine response in appropriate animals or cells and to bind with specific antibodies. 
The terms "complementary" and "complementarity" refer to the natural binding of 

30 polynucleotides by base pairing. For example, the sequence "5* A-G-T 3'" bonds to the 

complementary sequence "3* T-C-A 5*." Complementarity between two single-stranded molecules 
may be "partial." such that only some of the nucleic acids bind, or it may be "complete," such that 
total complementarity exists between the single stranded molecules. The degree of complementarity 
between nucleic acid strands has significant effects on the efficiency and strength of the hybridization 

35 between the nucleic acid strands. This is of particular importance in amplification reactions, which 
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depend upon binding between nucleic acid strands, and in the design and use of peptide nucleic acid 
(PNA) molecules. 

A "composition comprising a given polynucleotide sequence" and a "composition comprising 
a given amino acid sequence" refer broadly to any composition containing the given polynucleotide 
5 or amino acid sequence. The composition may comprise a dry formulation or an aqueous solution. 
Compositions comprising polynucleotide sequences encoding HSECP or fragments of HSECP may be 
employed as hybridization probes. The probes may be stored in freeze-dried form and may be 
associated with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be 
deployed in an aqueous solution containing salts (e.g., NaCl), detergents (e.g., sodium dodecyl 
10 sulfate; SDS), and other components (e.g., Denhardt's solution, dry milk, salmon sperm DNA, etc.). 
"Consensus sequence'* refers to a nucleic acid sequence which has been resequenced to 
resolve uncalled bases, extended using the XL-PCR kit (Perkin-Elmer, Norwalk CT) in the 5' and/or 
the 3' direction, and resequenced, or which has been assembled from the overlapping sequences of 
one or more Incyte Clones and, in some cases, one or more public domain ESTs, using a computer 
15 program for fragment assembly, such as the GELVBEW fragment assembly system (GCG, Madison 
WI). Some sequences have been both extended and assembled to produce the consensus sequence. 

"Conservative amino acid substitutions" are those substinitions that, when made, least 
interfere with the properties of the original protein, i.e., the structure and especially the function of 
the protein is conserved and not significantly changed by such substitutions. The table below shows 
20 amino acids which may be substituted for an original amino acid in a protein and which are regarded 
as conservative amino acid substitutions. 





Original Residue 


Conservative Substitution 




Ala 


Gly, Ser 




Arg 


His, Lys 


25 


Asn 


Asp, Gin, His 




Asp 


Asn, Glu 




Cys 


Ala, Ser 




Gin 


Asn, Glu, His 




Glu 


Asp, Gin, His 


30 


Gly 


Ala 




His 


Asn, Arg, Gin, Glu 




De 


Leu, Val 




Leu 


He, Val 




Lys 


Arg, Gin. Glu 


35 


Met 


Leu, He 




Phe 


His, Met, Leu, Trp, Tyr 




Ser 


Cys, Thr 




Thr 


Ser. Val 




Trp 


Phe, Tyr 


40 


Tyr 


His, Phe, Trp 




Val 


He, Leu, Thr 
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Conservative amino acid substitutions generally niaintain (a) the structure of the polypeptide 
backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, 
(b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of 
the side chain. 

S A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 

absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to the chemical modification of a polypeptide sequence, or a 
polynucleotide sequence. Chemical modifications of a polynucleotide sequence can include, for 
example, replacement of hydrogen by an alkyl, acyl, hydroxyl, or amino group. A derivative 

10 polynucleotide encodes a polypeptide which retains at least one biological or immunological function 
of the natural molecule. A derivative polypeptide is one modified by glycosylation, pegylation, or 
any similar process that retains at least one biological or immunological function of the polypeptide 
from which it was derived. 

A "fragment" is a unique portion of HSECP or the polynucleotide encoding HSECP which is 

IS identical in sequence to but shorter in length than the parent sequence. A fragment may comprise up 
to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For example, 
a fragment may comprise from S to 1000 contiguous nucleotides or amino acid residues. A fragment 
used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 5, 10, 
15, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or amino acid 

20 residues in length. Fragments may be preferentially selected from certain regions of a molecule. For 
example, a polypeptide fragment may comprise a certain length of contiguous amino acids selected 
from the first 250 or 500 amino acids (or first 25% or 50% of a polypeptide) as shown in a certain 
defmed sequence. Clearly these lengths are exemplary, and any length that is supported by the 
specification, including the Sequence Listing, tables, and figures, may be encompassed by die present 

25 embodiments. 

A fragment of SEQ ID NO:23-44 comprises a region of unique polynucleotide sequence that 
specifically identifies SEQ ID NO:23-44, for example, as distinct from any other sequence in the 
same genome. A fragment of SEQ ID NO:23-44 is useful, for example, in hybridization and 
amplification technologies and in analogous methods that distinguish SEQ ID NO:23-44 from related 

30 polynucleotide sequences. The precise lengUi of a fragment of SEQ ID NO:23-44 and the region of 
SEQ ID NO:23'44 to which the fragment corresponds are routinely determinable by one of ordinary 
skill in the art based on the intended purpose for the fragment. 

A fragment of SEQ ID NO: 1-22 is encoded by a fragment of SEQ ID NO:23-44. A fragment 
of SEQ ID NO: 1-22 comprises a region of unique amino acid sequence that specifically identifies 

35 SEQ ID NO: 1-22. For example, a fragment of SEQ ID NO: 1-22 is useful as an immunogenic peptide 
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for the development of antibodies that specifically recognize SEQ ID NO: 1-22. TTie precise length of 
a fragment of SEQ ID NO:l-22 and the region of SEQ ID NO:l-22 to which the fragment 
corresponds are routinely determinable by one of ordinary skill in the art based on the intended 
purpose for the fragment. 

5 The term "similarity" refers to a degree of complementarity. There may be partial similarity 

or complete similarity. The word "identity" may substitute for the word "similarity." A partially 
complementary sequence that at least partially inhibits an identical sequence from hybridizing to a 
target nucleic acid is referred to as "substantially similar " The inhibition of hybridization of the 
completely complementary sequence to the target sequence may be examined using a hybridization 

1 0 assay (Southern or northern blot, solution hybridization, and the like) under conditions of reduced 
stringency. A substantially similar sequence or hybridization probe will compete for and inhibit the 
binding of a completely similar (identical) sequence to the target sequence under conditions of 
reduced stringency. This is not to say that conditions of reduced stringency are such that non-specific 
binding is permitted, as reduced stringency conditions require that the binding of two sequences to 

15 one another be a specific (i.e., a selective) interaction. The absence of non-specific binding may be 
tested by the use of a second target sequence which lacks even a partial degree of complementarity 
(e.g., less than about 30% similarity or identity). In the absence of non-specific binding, the 
substantially similar sequence or probe will not hybridize to the second non-complementary target 
sequence. 

20 The phrases "percent identity" and "% identity," as applied to polynucleotide sequences, 

refer to the percentage of residue matches between at least two polynucleotide sequences aligned 
using a standardized algorithm. Such an algorithm may insert, in a standardized and reproducible 
way, gaps in the sequences being compared in order to optimize alignment between two sequences, 
and therefore achieve a more meaningful comparison of the two sequences. 

25 Percent identity between polynucleotide sequences may be determined using the default 

parameters of the CLUSTAL V algorithm as incorporated into the MEGAUGN version 3. 12e 
sequence alignment program. This program is part of the LASERGENE software package, a suite of 
molecular biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in 
Higgins, D.G. and P.M. Sharp (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. (1992) CABIOS 

30 8:1 89-1 9 1 . For pairwise alignments of polynucleotide sequences, the default parameters are set as 
follows: Ktuple=2, gap penalty=5, window=4, and "diagonals saved"=4. The "weighted" residue 
weight table is selected as the default. Percent identity is reported by CLUSTAL V as the "percent 
similarity" between aligned polynucleotide sequence pairs. 

Alternatively, a suite of commonly used and freely available sequence comparison algorithms 

35 is provided by the National Center for Biotechnology Information (NCBI) Basic Local Alignment 
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Search Tool (BLAST) (Altschul, S.R et al. (1990) J. MoL Biol. 215:403-410). which is available 
from several sources, including the NCBI, Bethesda, MD, and on the Internet at 
http://www.ncbi.nlm.nih.gov/BLAST/. The BLAST software suite includes various sequence 
analysis programs including "blastn/* that is used to align a known polynucleotide sequence with 
other polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
Sequences" can be accessed and used interactively at http://www.ncbi.nlm.nih.gov/gorf/bl2.html. 
The "BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
programs are commonly used with gap and other parameters set to default settings. For example, to 
compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
2.0.9 (May-07-1999) set at default parameters. Such default parameters may be, for example: 

Matrix: BLOSUM62 

Reward for match: I 

Penalty for mismatch: -2 

Open Gap: 5 and Extension Gap: 2 penalties 

Gap X drop-off: 50 

Expect: 10 

Word Size: U 

Filter: on 

Percent identity may be njeasured over the length of an entire defined sequence, for example, 
as defined by a particular SEQ ID number, or may be measured over a shorter length, for example, 
over the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at 
least 20, at least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous 
nucleotides. Such lengths are exemplary only, and it is understood that any fragment length 
supported by the sequences shown herein, in the tables, figures, or Sequence Listing, may be used to 
describe a length over which percentage identity may be measured. 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes 
in a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid 
sequences that all encode substantially the same protein. 

The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer to 
the percentage of residue matches between at least two polypeptide sequences aligned using a 
standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some 
alignment methods take into account conservative amino acid substitutions. Such conservative 
substitutions, explained in more detail above, generally preserve the hydrophobicity and acidity at the 



15 



wo 00/52151 



PCTAJSOO/05621 



site of substitution, thus preserving the structure (and therefore function) of the polypeptide. 

Percent identity between polypeptide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e 
sequence alignment program (described and referenced above). For pairwise alignments of 
5 polypeptide sequences using CLUSTAL V, the default parameters are set as follows: Ktuple=: 1 » gap 
penalty=3, window=5, and ''diagonals saved"=:5. The PAM250 matrix is selected as the default 
residue weight table. As with polynucleotide alignments, the percent identity is reported by 
(TLUSTAL V as the "percent similarity" between aligned polypeptide sequence pairs. 

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 
10 comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 2.0.9 
(May-07-1999) with blastp set at default parameters. Such default parameters may be, for example: 

Matrix: BLOSUM62 

Open Gap: 11 and Extension Gap: 1 penalties 
Cap X drop-off: 50 
15 Expect: 10 

Word Size: 3 
Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, 
for example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 

20 example, over the length of a fragment taken from a larger, defined polypeptide sequence, for 

instance, a fragment of at least IS, at least 20, at least 30, at least 40, at least SO, at least 70 or at least 
ISO contiguous residues. Such lengths are exemplary only, and it is understood that any fragment 
length supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be 
used to describe a length over which percentage identity may be measured. 

25 "Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 

DNA sequences of about 6 kb to 10 Mb in size, and which contain all of the elements required for 
stable mitotic chromosome segregation and maintenance. 

The term "humanized antibody" refers to antibody molecules in which the amino acid 
sequence in the non-antigen binding regions has been altered so that the antibody more closely 

30 resembles a human antibody, and still retains its original binding ability. 

•Hybridization" refers to the process by which a polynucleotide strand anneals with a 
complementary strand through base pairing under defined hybridization conditions. Specific 
hybridization is an indication that two nucleic acid sequences share a high degree of identity. 
Specific hybridization complexes form under permissive annealing conditions and remain hybridized 

35 after the "washing" step(s). The washing step(s) is particularly important in determining the 
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stringency of the hybridization process, with more stringent conditions allowing less non-specific 
binding, i.e., binding between pairs of nucleic acid strands that are not perfectly matched. Permissive 
conditions for annealing of nucleic acid sequences are routinely determinable by one of ordinary skill 
in the art and may be consistent among hybridization experiments, whereas wash conditions may be 
5 varied among experiments to achieve the desired stringency, and therefore hybridization specificity. 
Permissive annealing conditions occur, for example, at 68*^0 in the presence of about 6 x SSC, about 
1% (w/v) SDS, and about 100 pg/ml denatured salmon sperm DNA. 

Generally, stringency of hybridization is expressed, in part, with reference to the temperature 
under which the wash step is carried out. Generally, such wash temperatures are selected to be about 

10 5^C to 2(fC lower than the thermal melting point (T^ for the specific sequence at a defined ionic 
strength and pH. The T„ is the temperature (under defined ionic strength and pH) at which 50% of 
the target sequence hybridizes to a perfectly matched probe. An equation for calculating T„ and 
conditions for nucleic acid hybridization are well known and can be found in Sambrook et al., 1989, 
Molecular Cloning: A Laboratory ManuaL 2^ ed., vol. 1-3, Cold Spring Harbor Press, Plainview NY; 

15 specifically see volume 2, chapter 9. 

High stringency conditions for hybridization between polynucleotides of the present 
invention include wash conditions of dS'^C in the presence of about 0.2 x SSC and about 0.1% SDS, 
for 1 hour. Alternatively, temperatures of about 65°C, 60°C, 55**C, or 42°C may be used. SSC 
concentration may be varied from about 0.1 to 2 x SSC, with SDS being present at about 0.1 %. 

20 Typically, blocking reagents are used to block non-specific hybridization. Such blocking reagents 
include, for instance, denatured salmon sperm DNA at about 100-200 fig/ml. Organic solvent, such 
as formamide at a concentration of about 35-50% v/v, may also be used under particular 
circumstances, such as for RNA:DNA hybridizations. Useful variations on these wash conditions 
will be readily apparent to those of ordinary skill in the art. Hybridization, particularly under high 

25 stringency conditions, may be suggestive of evolutionary similarity between the nucleotides. Such 
similarity is strongly indicative of a similar role for the nucleotides and their encoded polypeptides. 

The term "hybridization complex'* refers to a complex formed between two nucleic acid 
sequences by virtue of the formation of hydrogen bonds between complementary bases. A 
hybridization complex may be formed in solution (e.g.. Cot or R^t analysis) or formed between one 

30 nucleic acid sequence present in solution and another nucleic acid sequence immobilized on a solid 
support (e.g., paper, membranes, filters, chips, pins or glass slides, or any other appropriate substrate 
to which cells or their nucleic acids have been fixed). 

The words "insertion" and "addition" refer to changes in an amino acid or nucleotide 
sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

35 "Immime response" can refer to conditions associated with inflammation, trauma, immune 
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disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression 
of various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect 
cellular and systemic defense systems. 

An "imniunogenic fragment" is a polypeptide or oligopeptide fragment of HSECP which is 
5 capable of eliciting an immune response when introduced into a living organism, for example, a 
manunal. The term ''immunogenic fragment** also includes any polypeptide or oligopeptide fragment 
of HSECP which is useful in any of the antibody production methods disclosed herein or known in 
the art. 

The term ^'microarray" refers to an arrangement of distinct polynucleotides on a substrate. 
10 The terms ""element" and "*array element" in a microarray context, refer to hybridizable 

polynucleotides arranged on the surface of a substrate. 

The term "modulate" refers to a change in the activity of HSECP. For example, modulation 

may cause an increase or a decrease in protein activity, binding characteristics, or any other 

biological, functional, or immunological properties of HSECP. 
15 The phrases "nucleic acid" or "nucleic acid sequence," as used herein, refer to a nucleotide, 

oligonucleotide, polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA 

of genomic or synthetic origin which may be single^stranded or double-stranded and may represent 

the sense or the antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like 

material. 

20 "Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 

functional relationship with the second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
sequence. Generally, operably linked DNA sequences may be in close proximity or contiguous and, 
where necessary to join two protein coding regions, in the same reading frame. 

25 "Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 

comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 
amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. 
PNAs preferentially bind complementary single stranded DNA or RNA and stop transcript 
elongation, and may be pegylated to extend their lifespan in the cell. 

30 "Probe" refers to nucleic acid sequences encoding HSECP, their complements, or fragments 

thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are 
isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. 
Typical labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. 
"Primers" are short nucleic acids, usually DNA oligonucleotides, which may be annealed to a target 

35 polynucleotide by complementary base-pairing. The primer may then be extended along the target 
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DNA strand by a DNA polymerase enzyme. Primer pairs can be used for amplification (and 
identification) of a nucleic acid sequence, e.g., by the polymerase chain reaction (PCR). 

Probes and primers as used in the present invention typically comprise at least IS contiguous 
nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 

5 be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 
pr at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers 
may be considerably longer than these examples, and it is understood that any length supported by the 
specification, including the tables, figures, and Sequence Listing, may be used. 

Methods for preparing and using probes and primers are described in the references, for 

10 example Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual . 2"^ ed., vol. 1-3, Cold 
Spring Harbor Press, Plain view NY; Ausubel et al.,1987. Current Protocols in Molecular Biologv . 
Greene Publ. Assoc. & Wiley-Intersciences, New York NY; Innis et al., 1990, PCR Protocols. A 
Guide to Methods and Applications ^ Academic Press, San Diego CA. PCR primer pairs can be 
derived from a known sequence, for example, by using computer programs intended for that purpose 

15 such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge MA). 

Oligonucleotides for use as primers are selected using software known in the art for such 
purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 
100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 
5,000 nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer 

20 selection programs have incorporated additional features for expanded capabilities. For example, the 
PrimOU primer selection program (available to the public from the Genome Center at University of 
Texas South West Medical Center, Dallas TX) is capable of choosing specific primers from 
megabase sequences and is thus useful for designing primers on a genome-wide scope. The Primer3 
primer selection program (available to the public from the Whitehead Institute/MIT Center for 

25 Genon^ Research, Cambridge MA) allows the user to input a "mispriming library," in which 

sequences to avoid as primer binding sites are user-specified. PrimerS is useful, in particular, for the 
selection of oligonucleotides for microarrays. (The source code for the latter two primer selection 
programs may also be obtained from their respective sources and modified to meet the user's specific 
needs.) The PrimeGen program (available to the public from the UK Human Genome Mapping 

30 Project Resource Centre, Cambridge UK) designs primers based on multiple sequence alignments, 
thereby allowing selection of primers that hybridize to either the most conserved or least conserved 
regions of aligned nucleic acid sequences. Hence, this program is useful for identification of both 
unique and conserved oligonucleotides and polynucleotide fragments. The oligonucleotides and 
polynucleotide fragments identified by any of the above selection methods are useful in hybridization 

35 technologies, for example, as PCR or sequencing primers, microarray elements, or specific probes to 
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identify fully or partially complementary polynucleotides in a sample of nucleic acids. Methods of 
oligonucleotide selection are not limited to those described above. 

A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 
that is made by ah artificial combination oif two or more otherwise separated segments of sequence. 

5 This artificial combination is often accomplished by chemical synthesis or, more commonly, by the 
artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
such as those described in Sambrook, supra . The term recombinant includes nucleic acids that have 
been altered solely by addition, substitution, or deletion of a portion of the nucleic acid. Frequently, a 
recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter 

10 sequence. Such a recombinant nucleic acid may be part of a vector that is used, for example, to 
transform a cell. 

Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 
vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
expressed, inducing a protective inununological response in the mammal. 

15 An "RNA equivalent," in reference to a DNA sequence, is composed of the same linear 

sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of the 
nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 
instead of deoxyribose. 

The term "sample" is used in its broadest sense. A sample suspected of containing nucleic 

20 acids encoding HSECP, or fragments thereof, or HSECP itself, may comprise a bodily fluid; an 

extract from a cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, 
RNA, or cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" and "specifically binding" refer to that interaction between a 
protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 

25 synthetic binding composition. The interaction is dependent upon the presence of a particular 
structure of the protein, e.g., the antigenic determinant or epitope, recognized by the binding 
molecule. For example, if an antibody is specific for epitope "A," the presence of a polypeptide 
containing the epitope A, or the presence of free unlabeled A, in a reaction containing free labeled A 
and the antibody will reduce the amount of labeled A that binds to the antibody. 

30 The term "substantially purified" refers to nucleic acid or amino acid sequences that are 

removed from their natural environment and are isolated or separated, and are at least 60% fiee, 
preferably at least 75% free, and most preferably at least 90% free from other components with which 
they are naturally associated. 

A "substitution" refers to the replacement of one or more amino acids or nucleotides by 

35 different amino acids or nucleotides, respectively. 
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^'Substrate'* refers to any suitable rigid or semi-rigid support including membranes, filters, 
chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 
5 'Transformation" describes a process by which exogenous DNA enters and changes a 

recipient cell. Transformation may occur under natural or artificial conditions according to various 
methods well known in the art, and may rely on any known method for the insertion of foreign 
nucleic acid sequences into a prokaryotic or eukaryotic host cell. The method for transformation is 
selected based on the type of host cell being transformed and may include, but is not limited to, viral 

10 infection, electroporation, heat shock, lipofection, and particle bombardment. The term 

''transformed" cells includes stably transformed cells in which the inserted DNA is capable of 
replication either as an autonomously replicating plasmid or as part of the host chromosome, as well 
as transiently transformed cells which express the inserted DNA or RNA for limited periods of time. 
A "transgenic organism," as used herein, is any organism, including but not limited to 

15 animals and plants, in which one or more of the cells of the organism contains heterologous nucleic 
acid introduced by way of human intervention, such as by transgenic techniques well known in the 
art. The nucleic acid is introduced into the cell, directly or indirectly by introduction into. a precursor 
of the cell, by way of deliberate genetic manipulation, such as by microinjection or by infection with 
a recombinant virus. The term genetic manipulation does not include classical cross-breeding, or in 

20 vitro fertilization, but rather is directed to the introduction of a recombinant DNA molecule. The 
transgenic organisms contemplated in accordance with the present invention include bacteria, 
cyanobacteria, fungi, and plants and animals. The isolated DNA of the present invention can be 
introduced into the host by methods known in the art, for example infection, transfection, 
transformation or transconjugation. Techniques for transferring the DNA of the present invention 

25 into such organisms are widely known and provided in references such as Sambrook et al. (1989), 
supra . 

A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having 
at least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of 
the nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 

30 1999) set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at 
least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 95% or at least 98% or 
greater sequence identity over a certain defined length. A variant may be described as, for example, 
an "allelic" (as defined above), "splice," "species," or "polymorphic" variant. A splice variant may 
have significant identity to a reference molecule, but will generally have a greater or lesser number of 

35 polynucleotides due to alternate splicing of exons during mRNA processing. The corresponding 
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polypeptide may possess additional functional domains or lack domains that are present in the 
reference molecule. Species variants are polynucleotide sequences that vary from one species to 
another. The resulting polypeptides generally will have significant amino acid identity relative to 
each other. A polymorphic variant is a variation in the polynucleotide sequence of a particular gene 
5 between individuals of a given species. Polymorphic variants also may encompass "single nucleotide 
polymorphisms*' (SNPs) in which the polynucleotide sequence varies by one nucleotide base. The 
presence of SNPs may be indicative of, for example, a certain population, a disease state, or a 
propensity for a disease state. 

A ^Variant" of a particular polypeptide sequence is defmed as a polypeptide sequence having 

10 at least 40% sequence identity to the particular polypeptide sequence over a certain length of one of 
the polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at 
least 60%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 98% or greater sequence 
identity over a certain defined length of one of the polypeptides. 

15 THE INVENTION 

The invention is based on the discovery of new human human secretory proteins (HSECP), 
the polynucleotides encoding HSECP, and the use of these compositions for the diagnosis, treatment, 
or prevention of cancer, inflammation, and gastrointestinal,.cardiovascular, and neurological 
disorders. 

20 Table 1 lists the Incyte clones used to assemble full length nucleotide sequences encoding 

HSECP. Colunms 1 and 2 show the sequence identification numbers (SEQ ID NOs) of the 
polypeptide and nucleotide sequences, respectively. Colunui 3 shows the clone IDs of the Incyte 
clones in which nucleic acids encoding each HSECP were identified, and column 4 shows the cDNA 
libraries from which these clones were isolated. Column 5 shows Incyte clones and their 

25 corresponding cDNA libraries. Clones for which cDNA libraries are not indicated were derived from 
pooled cDNA libraries. The Incyte clones in column 5 were used to assemble the consensus 
nucleotide sequence of each HSECP and are useful as fragments in hybridization technologies. 

The columns of Table 2 show various properties of each of the polypeptides of the invention: 
column 1 references the SEQ ID NO; column 2 shows the number of amino acid residues in each 

30 polypeptide; column 3 shows potential phosphorylation sites; column 4 shows potential glycosylation 
sites; column 5 shows the amino acid residues comprising signature sequences and motifs; and 
column 6 shows analytical methods and in some cases, searchable databases to which the analytical 
methods were applied. The methods of^column 6 were used to characterize each polypeptide through 
sequence homology and protein motifs. In column 5, the first line of each cell lists the amino acid 

35 residues comprising predicted signal peptide sequences located at the amino terminus of each 
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HSECP. Additional identifying motifs or signatures, such as a somatomedin B signature in SEQ ID 
NO: 16 and seven putative transmembrane domains in SEQ ID NO: 18, are also listed in column S. 

The columns of Table 3 show the tissue-specificity and diseases, disorders, or conditions 
associated with nucleotide sequences encoding HSECP. The first column of Table 3 lists the 
5 nucleotide SEQ ID NOs. Column 2 lists fragments of the nucleotide sequences of column 1. These 
fragments are useful, for example, in hybridization or amplification technologies to identify SEQ ID 
NO:23-44 and to distinguish between SEQ ID NO:23-44 and related polynucleotide sequences. The 
polypeptides encoded by these fragments are useful, for example, as immunogenic peptides. Column 
3 lists tissue categories which express HSECP as a fraction of total tissues expressing HSECP. 

10 Column 4 lists diseases, disorders, or conditions associated with those tissues expressing HSECP as a 
fraction of total tissues expressing HSECP. Column 5 lists the vectors used to subclone each cDNA 
library. In particular, three out of four cDNA libraries which express SEQ ID NO:23 are derived 
from cartilage and synovia associated with joint inflammation, and four out of five cDNA libraries 
which express SEQ ID NO:29 are derived from intestinal tissue. Furthermore, about half of the 

15 cDNA libraries expressing SEQ ID NO:34 are associated with inflammation or the 

hematopoietic/immune system. Likewise, about half of the cDNA libraries expressing SEQ ID 
N0:3S are associated with inflammation or the hematopoietic/immune system, and in particular, with 
inflammation of the joints. In addition, 82% of the cDNA libraries expressing SEQ ID NO:37 are 
derived from tissues of the nervous system. Finally, expression of SEQ ID NO:39 is detected solely 

20 in a subtracted prostate tumor cDNA library, and expression of SEQ ID NO:43 is detected only in two 
cDNA libraries derived from heart tissue. 

The columns of Table 4 show descriptions of the tissues used to construct the cDNA libraries 
from which cDNA clones encoding HSECP were isolated. Column 1 references the nucleotide SEQ 
ID NOs, column 2 shows the cDNA libraries from which these clones were isolated, and column 3 

25 shows the tissue origins and other descriptive information relevant to the cDNA libraries in column 2. 

The invention also encompasses HSECP variants. A preferred HSECP variant is one which 
has at least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid 
sequence identity to the HSECP amino acid sequence, and which contains at least one functional or 
structural characteristic of HSECP. 

30 The invention also encompasses polynucleotides which encode HSECP. In a particular 

embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected 
from the group consisting of SEQ ID NO:23-44, which encodes HSECP. The polynucleotide 
sequences of SEQ ID NO:23-44, as presented in the Sequence Listing, embrace the equivalent RNA 
sequences, wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the 

35 sugar backbone is composed of ribose instead of deoxyribose. 
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The invention also enconq>asses a variant of a polynucleotide sequence encoding HSECP. In 
particular, such a variant polynucleotide sequence will have at least about 70%, or alternatively at 
least about 85%, or even at least about 95% polynucleotide sequence identity to the polynucleotide 
sequence encoding HSECP. A particular aspect of the invention encompasses a variant of a 

5 polynucleotide sequence comprising a sequence selected from the group consisting of SEQ ID 
NO:23-44 which has at least about 70%, or alternatively at least about 85%, or even at least about 
95% polynucleotide sequence identity to a nucleic acid sequence selected from the group consisting 
of SEQ ID NO:23-44. Any one of the polynucleotide variants described above can encode an amino 
acid sequence which contains at least one functional or structural characteristic of HSECP. 

10 It will be appreciated by those skilled in the art that as a result of the degeneracy of the 

genetic code, a multitude of polynucleotide sequences encoding HSECP, some bearing minimal 
similarity to the polynucleotide sequences of any known and naturally occurring gene, may be 
produced. Thus, the invention contemplates each and every possible variation of polynucleotide 
sequence that could be made by selecting combinations based on possible codon choices. These 

15 combinations are made in accordance with the standard triplet genetic code as applied to the 

polynucleotide sequence of naturally occurring HSECP, and ali such variations are to be considered 
as being specifically disclosed. 

Although nucleotide sequences which encode HSECP and its variants are generally capable 
of hybridizing to the nucleotide sequence of the naturally occurring HSECP under appropriately 

20 selected conditions of stringency, it may be advantageous to produce nucleotide sequences encoding 
HSECP or its derivatives possessing a substantially different codon usage, e.g., inclusion of non- 
naturally occurring codons. Codons may be selected to increase the rate at which expression of the 
peptide occurs in a particular prokaryotic or eukaryotic host in accordance with the frequency with 
which particular codons are utilized by the host. Other reasons for substantially altering the 

25 nucleotide sequence encoding HSECP and its derivatives without altering the encoded amino acid 
sequences include the production of RNA transcripts having more desirable properties, such as a 
greater half-life, than transcripts produced from the naturally occurring sequence. 

The invention also encompasses production of DNA sequences which encode HSECP and 
HSECP derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 

30 synthetic sequence may be inserted into any of the many available expression vectors and cell 
systems using reagents well known in the art. Moreover, synthetic chemistry may be used to 
introduce mutations into a sequence encoding HSECP or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of 
hybridizing to the claimed polynucleotide sequences, and. in particular, to those shown in SEQ ID 

35 NO;23-44 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. and 
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G33S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel. A.R. (1987) Methods Enzymol. 
152:507-51 1.) Hybridization conditions, including annealing and wash conditions, are described in 
"Definitions." 

Methods for DNA sequencing are well known in the art and may be used to practice any of 
5 the embodiments of the invention. The methods may employ such enzymes as the Klenow fragment 
of DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (Perkin- 
Elmer), thermostable T7 polymerase (Amersham Pharmacia Biotech, Piscataway NJ), or 
combinations of polymerases ahd proofreading exonucleases such as those found in the ELONGASE 
amplification system (Life Technologies, Gaithersburg MD). Preferably, sequence preparation is 

10 automated with machines such as the MICROLAB 2200 liquid transfer system (Hamilton, Reno NV). 
PTC200 thermal cycler (MJ Research. Watertown MA) and ABI CATALYST 800 thermal cycler 
(Perkin-Elmer). Sequencing is then carried out using either the ABI 373 or 377 DNA sequencing 
system (Perkin-Elmer), the MEGABACE 1000 DNA sequencing system (Molecular Dynamics, 
Sunnyvale CA), or other systems known in the art. The resulting sequences are analyzed using a 

15 variety of algorithms which are well known in the art. (See, e.g., Ausubel, F.M. (1997) Short 

Protocols in Molecular Biology . John Wiley & Sons, New York NY, unit 7.7; Meyers, R.A. (1995) 
Molecular Biologv and Biotechnologv. Wiley VCH, New York NY, pp. 856-853.) 

The nucleic acid sequences encoding HSECP may be extended utilizing a partial nucleotide 
sequence and employing various PCR-based methods known in the art to detect upstream sequences, 

20 such as promoters and regulatory elements. For example, one method which may be employed, 
restriction-site PCR, uses universal and nested primers to amplify unknown sequence from genomic 
DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods Applic. 2:318-322.) 
Another method, inverse PCR, uses primers that extend in divergent directions to amplify unknown 
sequence from a circularized template. The template is derived from restriction fragments comprising 

25 a known genomic locus and surrounding sequences. (See, e.g., Triglia, T. et al. (1988) Nucleic Acids 
Res. 16:8186.) A third method, capture PCR, involves PCR amplification of DNA fragments 
adjacent to known sequences in human and yeast artificial chromosome DNA. (See, e.g., Lagerstrom, 
M.etal. (1991) PCR Methods Applic. 1:111-119.) In this method, multiple restriction enzyme 
digestions and ligations may be used to insert an engineered double-stranded sequence into a region 

30 of unknown sequence before performing PCR. Other methods which may be used to retrieve 
unknown sequences are known in the art. (See, e.g., Parker, J.D. et al. (1991) Nucleic Acids Res. 
19:3055-3060). Additionally, one may use PCR, nested primers, and PROMOTERFINDER libraries 
(Clontech, Palo Alto CA) to walk genomic DNA. This procedure avoids the need to screen libraries 
and is useful in finding intron/exon junctions. For all PCR-based methods, primers npay be designed 

35 using conunercially available software, such as OLIGO 4.06 Primer Analysis software (National 
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Biosciences, Plymouth MN) or another appropriate program, to be about 22 to 30 nucleotides in 
length, to have a GC content of about 50% or more, and to anneal to the template at tenq}eratures of 
about 68**C to IT'C. 

When screening for full-length cDNAs, it is preferable to use libraries that have been 

5 size-selected to include larger cDNAs. Li addition, random-primed libraries, which often include 
sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) 
library does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence 
into S' non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used to analyze 

10 the size or confirm the nucleotide sequence of seiquencing or PGR products. In particular, capillary 
sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 
specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Perkin-Elmer), and the entire process 

15 from loading of samples to computer analysis and electronic data display may be computer 

controlled. Capillary electrophoresis is especially preferable for sequencing small DNA fragments 
which may be present in limited amounts in a particular sanq>le. 

In another embodiment of the invention, polynucleotide sequences or fragments thereof 
which encode HSECP may be cloned in recombinant DNA molecules that direct expression of 

20 HSECP, or fragments or functional equivalents thereof, in appropriate host cells. Due to the inherent 
degeneracy of the genetic code, other DNA sequences which encode substantially the same or a 
functionally equivalent amino acid sequence may be produced and used to express HSECP. 

The nucleotide sequences of the present invention can be engineered using methods generally 
known in the art in order to alter HSECP-encoding sequences for a variety of purposes including, but 

25 not limited to, modification of the cloning, processing, and/or expression of the gene product. DNA 
shuffling by random fragmentation and PGR reassembly of gene fragments and synthetic 
oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide- 
mediated site-directed mutagenesis miay be used to introduce mutations that create new restriction 
sites, alter glycosylation patterns, change codon preference, produce splice variants, and so forth. 

30 The nucleotides of the present invention may be subjected to DNA shuffling techniques such 

as MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent Number 
5,837,458; Chang, C.-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F.C. et al. (1999) Nat. 
Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat. Biotechnol. 14:315-319) to alter or 
improve the biological properties of HSECP, such as its biological or enzymatic activity or its ability 

35 to bind to other molecules or compounds. DNA shuffling is a process by which a library of gene 
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variants is produced using PCR-mediated recombination of gene fragments. The library is then 
subjected to selection or screening procedures that identify those gene variants with the desired 
properties. These preferred variants may then be pooled and further subjected to recursive rounds of 
DNA shuffling and selection/screening. Thus, genetic diversity is created through "artificial" 

5 breeding and rapid molecular evolution. For example, fragments of a single gene containing random 
point mutations may be recombined, screened, and then reshuffled until the desired properties are 
optimized. Alternatively, fragments of a given gene may be recombined with fragments of 
homologous genes in the same gene family, either from the same or different species, thereby' 
maximizing the genetic diversity of multiple naturally occurring genes in a directed and controllable 

10 manner. 

In another embodiment, sequences encoding HSECP may be synthesized, in whole or in part, 
using chemical methods well known in the art. (See, e.g., Caruthers, M.H. et al. (1980) Nucleic Acids 
Symp. Ser. 7:215-223; and Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232.) 
Alternatively, HSECP itself or a fragment thereof may be synthesized using chemical methods. For 

15 example, peptide synthesis can be performed using various solid-phase techniques. (See, e.g., 

Roberge, J.Y. et al. (1995) Science 269:202-204.) Automated synthesis may be achieved using the 
ABI 431 A peptide synthesizer (Perkin-Elmer). Additionally, the amino acid sequence of HSECP, or 
any part thereof, may be altered during direct synthesis and/or combined with sequences from other 
proteins, or any part thereof, to produce a variant polypeptide. 

20 The peptide may be substantially purified by preparative high performance liquid 

chromatography. (See, e.g., Chiez, R.M. and F.Z. Regnier (1990) Methods Enzymol. 182:392^21.) 
The composition of the synthetic peptides may be confirmed by amino acid analysis or by 
sequencing. (See, e.g., Creighton, T. (1984) Proteins. Structures and Molecular Properties . WH 
Freeman, New York NY.) 

25 In order to express a biologically active HSECP, the nucleotide sequences encoding HSECP 

or derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which 
contains the necessary elements for transcriptional and translational control of the inserted coding 
sequence in a suitable host. These elements include regulatory sequences, such as enhancers, 
constitutive and inducible promoters, and 5' and 3' untranslated regions in the vector and in 

30 polynucleotide sequences encoding HSECP, Such elements may vary in their strength and 

specificity. Specific initiation signals may also be used to achieve more efficient translation of 
sequences encoding HSECP. Such signals include the ATG initiation codon and adjacent sequences, 
e.g. the Kozak sequence. In cases where sequences encoding HSECP and its initiation codon and 
upstream regulatory sequences are inserted into the appropriate expression vector, no additional 

35 transcriptional or translational control signals may be needed. However, in cases where only coding 
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sequence, or a fragment thereof* is inserted, exogenous translational control signals including an in- 
frame ATG initiation codon should be provided by the vector. Exogenous translational elements and 
initiation codons may be of various origins, both natural and synthetic. The efficiency of expression 
may be enhanced by the inclusion of enhancers appropriate for the particular host cell system used. 

5 (See, e.g., Scharf, D. et al. (1994) Results Probl. Cell Differ. 20: 125-162.) 

Methods which are well known to those skilled in the art may be used to construct expression 
vectors containing sequences encoding HSECP and appropriate transcriptional and translational 
control elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, 
and in vivo genetic recombination. (See, e.g., Sambrook, J. et al. (1989) Molecular Cloning. A 

10 Laboratory Manual . Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 16-17; Ausubel, F.M. et 
al. (1995) Current Protocols in Molecular Bioloev . John Wiley & Sons, New York NY, ch. 9, 13, and 
16.) 

A variety of expression vector/host systems may be utilized to contain and express sequences 
encoding HSECP. These include, but are not limited to, microorganisms such as bacteria transformed 

15 with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with 
yeast expression vectors; insect cell systems infected with viral expression vectors (e.g., baculovirus); 
plant cell systems transformed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV, 
or tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or 
animal cell systems. The invention is not limited by the host cell employed. 

20 In bacterial systems, a number of cloning and expression vectors may be selected depending 

upon the use intended for polynucleotide sequences encoding HSECP. For example, routine cloning, 
subcloning, and propagation of polynucleotide sequences encoding HSECP can be achieved using a 
multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La JoUa CA) or PSPORTl 
plasmid (Life Technologies). Ligation of sequences encoding HSECP into the vector's multiple 

25 cloning site disrupts the lac2» gene, allowing a colorimetric screening procedure for identiflcation of 
transformed bacteria containing recombinant molecules. In addition, these vectors may be useful for 
in vitro transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of 
nested deletions in the cloned sequence. (See, e.g.. Van Heeke, G. and SAl. Schuster (1989) J. Biol. 
Chem. 264:5503-5509.) When large quantities of HSECP are needed, e.g. for the production of 

30 antibodies, vectors which direct high level expression of HSECP may be used. For example, vectors 
containing the strong, inducible T5 or T7 bacteriophage promoter may be used. 

Yeast expression systems may be used for production of HSECP. A number of vectors 
containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
promoters, may be used in the yeast Saccharomvces cerevisiae or Pichia pastoris . In addition, such 

35 vectors direct either the secretion or intracellular retention of expressed proteins and enable 
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integration of foreign sequences into the host genome for stable propagation. (See, e.g., Ausubel, 
1995, supra : Bitter. G.A. et al. (1987) Methods En2ymol. 153:516-544; and Scorer, C.A. et al. (1994) 
Bio/Technology 12:181-184.) 

Plant systenis may also be used for expression of HSECP. Transcription of sequences 
encoding HSECP may be driven viral promoters, e.g., the 35S and 19S promoters of ClaMV used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 
6:307-31 1). Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock 
promoters may be used. (See, e.g., Comzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. 
(1984) Science 224:838-843; and Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105.) 
These constructs can be introduced into plant cells by direct DNA transformation or 
pathogen-mediated transfection. (See, e.g.. The McGraw Hill Yearbook of Science and Technolopv 
(1992) McGraw Hill, New York NY, pp. 191-196.) 

In manunalian cells, a number of viral-based expression systems may be utilized. In cases 
where an adenovirus is used as an expression vector, sequences encoding HSECP may be ligated into 
an adenovirus transcription/translation complex consisting of the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to obtain 
infective vims which expresses HSECP in host cells. (See, e.g., Logan, J. and T. Shenk (1984) Proc. 
Natl. Acad. Sci. USA 81:3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma 
virus (RSV) enhancer, may be used to increase expression in mammalian host cells. SV40 or EBV- 
based vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 
DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivery methods (liposomes, polycationic amino 
polymers, or vesicles) for therapeutic purposes. (See, e.g., Harrington, J J. et al. (1997) Nat. Genet. 
15:345-355.) 

For long term production of recombinant proteins in manunalian systems, stable expression 
of HSECP in cell lines is preferred. For example, sequences encoding HSECP can be transformed 
uito cell lines using expression vectors which may contain viral origins of replication and/or 
endogenous expression elements and a selectable marker gene on the same or on a separate vector. 
Following the introduction of the vector, cells may be allowed to grow for about 1 to 2 days in 
enriched media before being switched to selective media. The purpose of the selectable marker is to 
confer resistance to a selective agent, and its presence allows growth and recovery of cells which 
successfully express the introduced sequences. Resistant clones of stably transformed cells may be 
propagated using tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. These 
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include, but are not limited to, the herpes simplex virus thymidine kinase and adenine 
phosphoribosyltransferase genes, for use in tk and apr ceils, respectively. (See, e.g., Wigler, M. et 
al. (1977) Cell 1 1:223-232; Lowy. I. et al. (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, 
or herbicide resistance can be used as the basis for selection. For example, dhfr confers resistance to 
5 methotrexate; neo confers resistance to the aminoglycosides neomycin and G-418; and als and pat 
confer resistance to chlorsulfiiron and phosphinotricin acetyltransferase, respectively. (See, e.g., 
Wigler, M. et al. (1980) Proc. Natl. Acad. Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) 
J. Mol. Biol. 150:1-14.) Additional selectable genes have been described, e.g., trpB and./tijD, which 
alter cellular requirements for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan (1988) Proc. 

10 Natl. Acad. Sci. USA 85:8047-8051.) Visible markers, e.g., anthocyanins, green fluorescent proteins 
(GFP; Clontech), B glucuronidase and its substrate fi-glucuronide, or luciferase and its substrate 
luciferin may be used. These markers can be used not only to identify transfonmants, but also to 
quantify the amount of transient or stable protein expression attributable to a specific vector system. 
(See, e.g., Rhodes, C.A. (1995) Methods Mol. Biol. 55:121-131.) 

15 Although the presence/absence of marker gene expression suggests that the gene of interest is 

also present, the presence and expression of the gene may need to be confirmed. For example, if the 
sequence encoding HSECP is inserted within a marker gene sequence, transformed cells containing 
sequences encoding HSECP can be identified by the absence of marker gene function. Alternatively, 
a marker gene can be placed in tandem with a sequence encoding HSECP under the control of a 

20 single promoter. Expression of the marker gene in response to induction or selection usually 
indicates expression of the tandem gene as well. 

Li general, host cells that contain the nucleic acid sequence encoding HSECP and that express 
HSECP may be identified by a variety of procedures known to those of skill in the art. These 
procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR 

25 amplification, and protein bioassay or inununoassay techniques which include membrane, solution, or 
chip based technologies for the detection and/or quantification of nucleic acid or protein sequences. 

Immunological methods for detecting and measuring the expression of HSECP using either 
specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques 
include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and 

30 fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing 
monoclonal antibodies reactive to two non-interfering epitopes on HSECP is preferred, but a 
competitive bmding assay may be employed. These and other assays are well known in the art. (See. 
e.g., Hampton, R. et al. (1990) Serological Methods, a Laboratory Manual . APS Press, St. Paul MN, 
Sect. IV; Coligan, J.E. et al. (1997) Current Protocols in Immunology . Greene Pub. Associates and 

35 Wiley-Interscience, New York NY; and Pound, J.D. ( 1 998) Immunochemical Protocols . Humana 

30 : 
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Press, Totowa NJ.) 

A wide variety of labels and conjugation techniques are known by those skilled in the art and 
may be used in various nucleic acid and amino acid assays. Means for producing labeled 
hybridization or PCR probes for detecting sequences related to polynucleotides encoding HSECP 
5 include oligolabeling, nick translation, end-labeling, or PCR amplification using a labeled nucleotide. 
Altematively, the sequences encoding HSECP, or any fragments thereof, may be cloned into a vector 
for the production of an mRNA probe. Such vectors are known in the art, are commercially available, 
and may be used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase 
such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using a variety 

10 of commercially available kits, such as those provided by Amersham Pharmacia Biotech, Promega 
(Madison WI), and US Biochemical. Suitable reporter molecules or labels which may be used for 
ease of detection include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic 
agents, as well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding HSECP may be cultured under 

15 conditions suitable for the expression and recovery of the protein from cell culture. The protein 

produced by a transformed cell may be secreted or retained intracellularly depending on the sequence 
and/or the vector used. As will be understood by those of skill in the art, expression vectors 
containing polynucleotides which encode HSECP may be designed to contain signal sequences which 
direct secretion of HSECP through a prokaryotic or eukaryotic cell membrane. 

20 In addition, a host cell strain may be chosen for its ability to modulate expression of the 

inserted sequences or to process the expressed protein in the desired fashion. Such modifications of 
the polypeptide include, but arc not limited to, acetylation, carboxylation, glycosylation, 
phosphorylation, lipidation, and acylation. Post-translational processing which cleaves a "prepro" or 
"pro" form of the protein may also be used to specify protein targeting, folding, and/or activity. 

25 Different host cells which have specific cellular machinery and characteristic mechanisms for 

post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and WI38) are available from the 
American Type Culture Collection (ATCC, Manassas VA) and may be chosen to ensure the correct 
modiflcation and processing of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 

30 sequences encoding HSECP may be ligated to a heterologous sequence resulting in translation of a 
fusion protein in any of the aforementioned host systems. For example, a chiineric HSECP protein 
containing a heterologous moiety that can be recognized by a conunercially available antibody may' 
facilitate the screening of peptide libraries for inhibitors of HSECP activity. Heterologous protein 
and peptide moieties may also facilitate purification of fusion proteins using commercially available 

35 affinity matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), 
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maltose binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, 
c-myc, and hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their 
cognate fusion proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and 
metal-chelate resins, respectively. FLAG, c-myc, and hemagglutinin (HA) enable immunoaffinity 
5 purification of fusion proteins using commercially available monoclonal and polyclonal antibodies 
that specifically recognize these epitope tags. A fusion protein may also be engineered to contain a 
proteolytic cleavage site located between the HSECP encoding sequence and the heterologous protein 
sequence, so that HSECP may be cleaved away from the heterologous moiety following purification. 
Methods for fusion protein expression and purification are discussed in Ausubel (1995, supra , ch. 10). 
10 A variety of commercially available kits may also be used to facilitate expression and purification of 
fusion proteins. 

In a further embodiment of the invention, synthesis of radiolabeled HSECP may be achieved 
in vitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These 
systems couple transcription and translation of protein-coding sequences operably associated with the 

15 T7, T3, or SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid 
precursor, for example, ^^S-methionine. 

Fragments of HSECP may be produced not only by recombinant means, but also by direct 
peptide synthesis using solid-phase techniques. (See, e.g., Creighton, supra, pp. 55-60.) Protein 
synthesis may be performed by manual techniques or by automation. Automated synthesis may be 

20 achieved, for example, using the ABI 43 1 A peptide synthesizer (Perkin-Elmer). Various fragments of 
HSECP may be synthesized separately and then combined to produce the full length molecule. 
THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 
between regions of HSECP and human secretory proteins. In addition, the expression of HSECP is 

25 closely associated with cancer, inflammation, and gastrointestinal, cardiovascular, and neurological 
disorders. Therefore, HSECP appears to play a role in cancer, inflammation, and gastrointestinal, 
cardiovascular, and neurological disorders. In the treatment of disorders associated with increased 
HSECP expression or activity, it is desirable to decrease the expression or activity of HSECP. In the 
treatment of disorders associated with decreased HSECP expression or activity, it is desirable to 

30 increase the expression or activity of HSECP. 

Therefore, in one embodiment, HSECP or a fragment or derivative thereof may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of HSECP. Examples of such disorders include, but are not limited to, a cancer such as 
adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in 

35 particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall 
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bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, 
parathyroid, penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus ; an 
inflammatory disorder such as acquired immunodeflciency syndrome (AIDS), Addison's disease, 
adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, 
5 atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmime 

polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, cholecystitis, contact 
dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, 
episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, eiythema nodosum, atrophic 
gastritis, glomerulonephritis, Goodpasture's syndrome, gout. Graves' disease, Hashimoto's 

10 thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, 
myocardial or pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, 
psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic 
anaphylaxis, systemic lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative 
colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and extracorporeal 

15 circulation, viral, bacterial, fungal, parasitic, protozoal, and helminthic infections, and trauma; a 
gastrointestinal disorder such as dysphagia, peptic esophagitis, esophageal spasm, esophageal 
stricture, esophageal carcinoma, dyspepsia, indigestion, gastritis, gastric carcinoma, anorexia, nausea, 
emesis, gastroparesis, antral or pyloric edema, abdominal angina, pyrosis, gastroenteritis, intestinal 
obstruction, infections of the intestinal tract, peptic ulcer, cholelithiasis, cholecystitis, cholestasis, 

20 pancreatitis, pancreatic carcinoma, biliary tract disease, hepatitis, hyperbilirubinemia, cirrhosis, 
passive congestion of the liver, hepatoma, infectious colitis, ulcerative colitis, ulcerative proctitis, 
Crohn's disease, Whijpple's disease, Mallory-Weiss syndrome, colonic carcinoma, colonic 
obstruction, irritable bowel syndrome, short bowel syndrome, diarrhea, constipation, gastrointestinal 
hemorrhage, acquired immunodeficiency syndrome (AIDS) enteropathy, jaundice, hepatic 

25 encephalopathy, hepatorenal syndrome, hepatic steatosis, hemochromatosis, Wilson's disease, alpha 
antitrypsin deficiency, Reye's syndrome, primary sclerosing cholangitis, liver infarction, portal vein 
obstruction and thrombosis, centrilobular necrosis, peliosis hepatis, hepatic vein thrombosis, veno- 
occlusive disease, preeclampsia, eclampsia, acute fatty liver of pregnancy, intrahepatic cholestasis of 
pregnancy, and hepatic tumors including nodular hyperplasias, adenomas, and carcinomas; a 

30 cardiovascular disorder, and in particular, a disorder of the heart such as congestive heart failure, 
ischemic heart disease, angina pectoris, myocardial infarction, hypertensive heart disease, 
degenerative valvular heart disease, calcific aortic valve stenosis, congenitally bicuspid aortic valve, 
mitral annular calcification, mitral valye prolapse, rheumatic fever and rheumatic heart disease, 
infective endocarditis, nonbacterial thrombotic endocarditis, endocarditis of systemic lupus 

35 erythematosus, carcinoid heart disease, cardiomyopathy, myocarditis, pericarditis, neoplastic heart 
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disease, congenital heart disease, and complications of cardiac transplantation; and a neurological 
disorder such as epilepsy » ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's 
disease. Pick's disease, Huntington's disease, dementia, Parkinson's disease and other extrapyramidal 
disorders, amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural 

5 muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating 
diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural abscess, 
suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous system 
disease, prion diseases including kuru, Creutzfeldt- Jakob disease, and Gerstmann- 
Straussler-Scheinker syndrome, fatal familial insomnia, nutritional and metabolic diseases of the 

10 nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 
encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central 
nervous system, cerebral palsy, neuroskeletal disorders, autonomic nervous system disorders, cranial 
nerve disorders, spinal cord diseases, muscular dystrophy and other neuromuscular disorders, 
peripheral nervous system disorders, dermatomyositis and polymyositis, inherited, metabolic, 

15 endocrine, and toxic myopathies, myasthenia gravis, periodic paralysis, mental disorders including 
mood, anxiety, and schizophrenic disorders, seasonal affective disorder (SAD), akathesia, anuiesia, 
catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic 
neuralgia, and Tourette* s disorder. 

In another embodiment, a vector capable of expressing HSECP or a fragment or derivative 

20 thereof may be administered to a subject to treat or prevent a disorder associated with decreased 
expression or activity of HSECP including, but not limited to, those described above. 

In a further embodiment, a pharmaceutical composition comprising a substantially purified 
HSECP in conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat 
or prevent a disorder associated with decreased expression or activity of HSECP including, but not 

25 limited to, those provided above. 

In still another embodiment, an agonist which modulates the activity of HSECP may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of HSECP including, but not limited to, those listed above. 

In a further embodiment, an antagonist of HSECP may be administered to a subject to treat or 

30 prevent a disorder associated with increased expression or activity of HSECP. Examples of such 
disorders include, but are not limited to, those cancer, inflammation, and gastrointestinal, 
cardiovascular, and neurological disorders described above. In one aspect, an antibody which 
specifically binds HSECP may be used directly as an antagonist or indirectly as a targeting or delivery 
mechanism for bringing a pharmaceutical agent to cells or tissues which express HSECP. 

35 In an additional embodiment, a vector expressing the complement of the polynucleotide 
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encoding HSECP may be administered to a subject to treat or prevent a disorder associated with 
increased expression or activity of HSECP including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary 
sequences, or vectors of the invention may be administered in combination with other appropriate 
5 therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made 
by one of ordinary skill in the art, according to conventional pharmaceutical principles. The 
combination of therapeutic agents may act synergistically to effect the treatment or prevention of the 
various disorders described above. Using this approach, one may be able to achieve therapeutic 
efficacy with lower dosages of each agent, thus reducing the potential for adverse side effects. 

10 An antagonist of HSECP may be produced using methods which are generally known in the 

art. In particular, purifled HSECP may be used to produce antibodies or to screen libraries of 
pharmaceutical agents to identify those which specifically bind HSECP. Antibodies to HSECP may 
also be generated using methods that are well known in the art. Such antibodies may include, but are 
not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies. Fab fragments, and 

15 fragments produced by a Fab expression library. Neutralizing antibodies (i.e., those which inhibit 
dimer formation) are generally preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, 
and others may be inmiunized by injection with HSECP or with any fragment or oligopeptide thereof 
which has immunogenic properties. Depending on the host species, various adjuvants may be used to 

20 increase immunological response. Such adjuvants include, but are not limited to, Freund^s, mineral 
gels such as aluminum hydroxide, and surface active substances such as lysolecithin, pluronic 
polyols, polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among adjuvants used in 
humans, BCG (bacilli Calmette-Guerin) and Corvnebacterium parvum are especially preferable. 
It is preferred that the oligopeptides, peptides, or fiagments used to induce antibodies to 

25 HSECP have an amino acid sequence consisting of at least about 5 amino acids, and generally will 
consist of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or 
fi-agments are identical to a portion of the amino acid sequence of the natural protein and contain the 
entire amino acid sequence of a small, naturally occurring molecule. Short stretches of HSECP 
amino acids may be fused with those of another protein, such as KLH, and antibodies to the chimeric 

30 molecule may be produced. 

Monoclonal antibodies to HSECP may be prepared using any technique which provides for 
the production of antibody molecules by continuous cell lines in culture. These include, but are not 
limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV-hybridoma 
technique. (See, e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. 

35 Immunol. Methods 81:31-42; Cote. R.J. et al. (1983) Proc. Natl. Acad. Sci. USA 80:2026-2030; and 
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Cole, S.R et al. (1984) Mol. Cell Biol. 62:109-120.) 

In addition^ techniques developed for the production of "chimeric antibodies," such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et al. (1984) Proc. 
5 Natl. Acad. Sci. USA 81 :6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; and Takeda, 
S. et al. (1985) Nature 314:452-454.) Alternatively, techniques described for the production of single 
chain antibodies may be adapted, using methods known in the art, to produce HSECP-specific single 
chain antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be 
generated by chain shuffling from random combinatorial inununoglobulin libraries. (See, e.g., 

10 Burton, D.R. (1991) Proc, Natl. Acad, Sci. USA 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte 
population or by screening inununoglobulin libraries or panels of highly specific binding reagents as 
disclosed in the literature. (See, e.g., Oriandi, R. et al. (1989) Proc. Natl. Acad. Sci. USA 
86:3833-3837; Winter, G. et al. (1991) Nature 349:293-299.) 

15 Antibody fragments which contain specific binding sites for HSECP may also be generated. 

For example, such fragments include, but are not limited to, F(ab*)2 fragments produced by pepsin 
digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of 
the F(ab')2 fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and 
easy identification of monoclonal Fab fragments with the desired specificity. (See, e.g., Huse, W.D. 

20 et al. (1989) Science 246:1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the desired 
specificity. Numerous protocols for competitive binding or inununoradiometric assays using either 
polyclonal or monoclonal antibodies with established specificities are well known in the art. Such 
inununoassays typically involve the measurement of complex formation between HSECP and its 

25 specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies 
reactive to two non-interfering HSECP epitopes is generally used, but a competitive binding assay 
may also be employed (Pound, supra) . 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay 
techniques may be used to assess the affinity of antibodies for HSECP. Affinity is expressed as an 

30 association constant, Kg, which is defined as the molar concentration of HSECP-antibody complex 
divided by the molar concentrations of firee antigen and free antibody under equilibrium conditions. 
The Kg determined for a preparation of polyclonal antibodies, which are heterogeneous in their 
affinities for multiple HSECP epitopes, represents the average affinity, or avidity, of the antibodies 
for HSECP. The determined for a preparation of monoclonal antibodies, which are monospecific 

35 for a particular HSECP epitope, represents a true measure of affinity. High-affinity antibody 
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preparations with ranging from about 10^ to 10*^ L/mole are preferred for use in inmmnoassays in 
which the HSECP-antibody complex must withstand rigorous manipulations. Low-affinity antibody 
preparations with ranging from about 10^ to 10^ L/mole are preferred for use in 
immunopurification and similar procedures which ultimately require dissociation of HSECP, 
5 preferably in active form, from the antibody (Catty, D. (1988) Antibodies, Volume I: A Practical 
Approach . IRL Press, Washington, DC; Liddell, J.E. and Cryer, A. (1991) A Practical Guide to 
Monoclonal Antibodies . John Wiley & Sons, New Yoric NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to 
determine the quality and suitability of such preparations for certain downstream applications. For 

10 example, a polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, 

preferably 5-10 mg specific antibody/ml, is generally employed in procedures requiring precipitation 
of HSECP-antibody complexes. Procedures for evaluating antibody specificity, titer, and avidity, and 
guidelines for antibody quality and usage in various applications, are generally available. (See, e.g., 
Catty, supra , and Coligan et al. supra .) 

15 In another embodiment of the invention, the polynucleotides encoding HSECP, or any 

fi-agment or complement thereof, may be used for therapeutic purposes. In one aspect, the 
complement of the polynucleotide encoding HSECP may be used in situations in which it would be 
desirable to block the transcription of the mRNA. In particular, cells may be transformed with 
sequences complementary to polynucleotides encoding HSECP. Thus, complementary molecules or 

20 fragments may be used to modulate HSECP activity, or to achieve regulation of gene function. Such 
technology is now well known in the art, and sense or antisense oligonucleotides or larger fragments 
can be designed from various locations along the coding or control regions of sequences encoding 
HSECP. 

Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia viruses, or 
25 from various bacterial plasmids, may be used for delivery of nucleotide sequences to the targeted 
organ, tissue, or cell population. Methods which are well known to those skilled in the art can be 
used to construct vectors to express nucleic acid sequences complementary to the polynucleotides 
encoding HSECP. (See, e.g., Sambrook, supra ; Ausubel, 1995, supra .) 

Genes encoding HSECP can be turned off by transforming a cell or tissue with expression 
30 vectors which express high levels of a polynucleotide, or fragment thereof, encoding HSECP. Such 
constructs may be used to introduce untranslatable sense or antisense sequences into a cell. Even in 
the absence of integration into the DNA, such vectors may continue to transcribe RNA molecules 
until they are disabled by endogenous nucleases. Transient expression may last for a month or more 
with a non-replicating vector, and may last even longer if appropriate replication elements are part of 
35 the vector system. 
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As mentioned above, modifications of gene expression can be obtained by designing 
complementary sequences or antisense molecules G^NA, RNA, or PNA) to the control, S', or 
regulatory regions of the gene encoding HSECP. Oligonucleotides derived from the transcription 
initiation site, e.g., between about positions -10 and +10 from the start site, may be employed. 

5 Similarly, inhibition can be achieved using triple helix base-pairing methodology. Triple helix 

pairing is useful because it causes inhibition of the ability of the double helix to open sufficiently for 
the binding of polymerases, transcription factors, or regulatory molecules. Recent therapeutic 
advances using triplex DNA have been described in the literature. (See, e.g.. Gee, J.E. et al. (1994) in 
Huber, B.E. and B.L Carr, Molecular and Immunologic Approaches. Futura Publishing, Mt. Klsco 

10 NY, pp. 163-177.) A complementary sequence or antisense molecule may also be designed to block 
translation of mRNA by preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze, the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 
molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, 

15 engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze 
endonucleolytic cleavage of sequences encoding HSECP. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
scanning the target molecule for ribozyme cleavage sites, including the following sequences: QUA, 
GUU, and GUC. Once identified, short RNA sequences of between IS and 20 ribonucleotides, 

20 corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary structural features which may render the oligonucleotide inoperable. The suitability of 
candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared 

25 by any method known in the art for the synthesis of nucleic acid molecules. These include techniques 
for chemically synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. 
Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DNA 
sequences encoding HSECP. Such DNA sequences may be incorporated into a wide variety of 
vectors with suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA 

30 constructs that synthesize complementary RNA, constitutively or inducibly, can be introduced into 
cell lines, cells, or tissues. 

RNA moleculesjmay be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3* 
ends of the molecule, or the use of phosphorothioate or 2'O-methyl rather than phosphodiesterase 

35 linkages within the backbone of the molecule. This concept is inherent in the production of PNAs 



38 



wo 00/52151 PCT/USOO/05621 

and can be extended in all of these molecules by the inclusion of nontraditional bases such as inosine, 
queosine, and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, 
cytidine, guanine, thymine, and uridine which are not as easily recognized by endogenous 
endonucleases. 

5 Many methods for introducing vectors into cells or tissues are available and equally suitable 

for use in vivo , in vitro, and ex vivo . For ex vivo therapy, vectors may be introduced into stem cells 
taken from the patient and clonally propagated for autologous transplant back into that same patient. 
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved 
using methods which are well known in the art. (See, e.g., Goldman, C.K. et al. (1997) Nat. 

10 Biotechnol. 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of 
such therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
monkeys. 

An additional embodiment of the invention relates to the administration of a pharmaceutical 

15 or sterile composition, in conjunction with a pharmaceutically acceptable carrier, for any of the 
therapeutic effects discussed above. Such pharmaceutical compositions may consist of HSECP, 
antibodies to HSECP, and mimetics, agonists, antagonists, or inhibitors of HSECP. The compositions 
may be administered alone or in combination with at least one other agent, such as a stabilizing 
compound, which may be administered in any sterile, biocompatible pharmaceutical carrier including, 

20 but not limited to, saline, buffered saline, dextrose, and water. The compositions may be administered 
to a patient alone, or in combination with other agents, drugs, or hormones. 

The pharmaceutical compositions utilized in this invention may be administered by any 
number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, 
intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, 

25 enteral, topical, sublingual, or rectal means. 

In addition to the active ingredients, these pharmaceutical compositions may contain suitable 
pharmaceutically-acceptable carriers comprising excipients and auxiliaries which facilitate processing 
of the active compounds into preparations which can be used pharmaceutically. Further details on 
techniques for formulation and administration may be found in the latest edition of Reminpon's 

30 Pharmaceutical Sciences (Maack Publishing, Easton PA). 

Pharmaceutical compositions for oral administration can be formulated using 
pharmaceutically acceptable carriers well known in the art in dosages suitable for oral administration. 
Such carriers enable the pharmaceutical compositions to be formulated as tablets, pills, dragees, 
capsules, liquids, gels, syrups, slurries, suspensions, and the like, for ingestion by the patient. 

35 Pharmaceutical preparations for oral use can be obtained through combining active 
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compounds with solid excipient and processing the resultant mixture of granules (optionally, after 
grinding) to obtain tablets or dragee cores. Suitable auxiliaries can be added, if desired. Suitable 
excipients include carbohydrate or protein fillers, such as sugars, including lactose, sucrose, mannitol, 
and sorbitol; starch from com, wheat, rice, potato, or other plants; cellulose, such as methyl cellulose, 

5 hydroxypropylmethyl-cellulose, or sodium carboxymethylcellulose; gums, including arabic and 

tragacanth; and proteins, such as gelatin and collagen. If desired, disintegrating or solubilizing agents 
may be added, such as the cross-linked polyvinyl pyrrolidone, agar, and alginic acid or a salt thereof, 
such as sodium alginate. 

Dragee cores may be used in conjunction with suitable coatings, such as concentrated sugar 

10 solutions, which niay also contain gum arabic, talc, polyvinylpyrrolidone, carbopol gel, polyethylene 
glycol, and/or titanium dioxide, lacquer solutions, and suitable organic solvents or solvent mixtures. 
Dyestuffs or pigments may be added to the tablets or dragee coatings for product identification or to 
characterize the quantity of active compound, i.e., dosage. 

Pharmaceutical preparations which can be used orally include push-fit capsules made of 

15 gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as glycerol or sorbitol. 
Push-fit capsules can contain active ingredients mixed with fillers or binders, such as lactose or 
starches, lubricants, such as talc or magnesium stearate, and, optionally, stabilizers. In soft capsules, 
the active compounds may be dissolved or suspended in suitable liquids, such as fatty oils, liquid, or 
liquid polyethylene glycol with or without stabilizers. 

20 Pharmaceutical formulations suitable for parenteral administration may be formulated in 

aqueous solutions, preferably in physiologically compatible buffers such as Hanks' solution. Ringer's 
solution, or physiologically buffered saline. Aqueous injection suspensions may contain substances 
which increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or 
dextran. Additionally, suspensions of the active compounds may be prepared as appropriate oily 

25 injection suspensions. Suitable lipophilic solvents or vehicles include fatty oils, such as sesame oil, 
or synthetic fatty acid esters, such as ethyl oleate, triglycerides, or liposomes. Non-lipid polycationic 
amino polymers may also be used for delivery. Optionally, the suspension may also contain suitable 
stabilizers or agents to increase the solubility of the compounds and allow for the preparation of 
highly concentrated solutions. 

30 For topical or nasal administration, penetrants appropriate to the particular barrier to be 

permeated are used in the formulation. Such penetrants are generally known in the art. 

The pharmaceutical compositions of the present invention may be manufactured in a manner 
that is known in the art, e.g., by means of conventional mixing, dissolving, granulating, 
dragee-making, levigating, emulsifying, encapsulating, entrapping, or lyophilizing processes. 

35 The pharmaceutical composition. may be provided as a salt and can be formed with many 
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acids» including but not limited to, hydrochloric, sulfuric, acetic, lactic, tartaric, malic, and succinic 
acids. Salts tend to be more soluble in aqueous or other protonic solvents than are the corresponding 
free base forms. In other cases, the preparation may be a lyophilized powder which may contain any 
or all of the following: 1 mM to 50 mM histidine, 0.1% to 2% sucrose, and 2% to 7% mannitol, at a 
5 pH range of 4.5 to 5.5, that is combined with buffer prior to use. 

After pharinaceutical compositions have been prepared, they can be placed in an appropriate 
container and labeled for treatment of an indicated condition. For administration of HSECP, such 
labeling would include amount, frequency, and method of administration. 

Pharmaceutical compositions suitable for use in the invention include compositions wherein 

10 the active ingredients are contained in an effective amount to achieve the intended purpose. The 
determination of an effective dose is well within the capability of those skilled in the art. 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, or pigs. 
An animal model may also be used to determine the appropriate concentration range and route of 

15 administration. Such information can then be used to determine useful doses and routes for 
administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example 
HSECP or fragments thereof, antibodies of HSECP, and agonists, antagonists or inhibitors of HSECP, 
which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be determined 

20 by standard pharmaceutical procedures in cell cultures or with experimental animals, such as by 
calculating the ED50 (the dose therapeutically effective in 50% of the population) or LD50 (the dose 
lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the 
therapeutic index, which can be expressed as the LDs^/EDjo ratio. Pharmaceutical compositions 
which exhibit large therapeutic indices are preferred. The data obtained from cell culture assays and 

25 animal studies are used to formulate a range of dosage for human use. The dosage contained in such 
compositions is preferably within a range of circulating concentrations that includes the ED50 with 
little or no toxicity. The dosage varies within this range depending upon the dosage form employed, 
the sensitivity of the patient, and the route of administration. 

The exact dosage will be determined by the practitioner, in light of factors related to the 

30 subject requiring treatment. Dosage and administration are adjusted to provide sufficient levels of the 
active moiety or to maintain the desired effect. Factors which may be taken into account include the 
severity of the disease state, the general health of the subject, the age, weight, and gender of the 
subject, time and frequency of administration, drug combination(s), reaction sensitivities, and 
response to therapy. Long-acting pharmaceutical compositions may be administered every 3 to 4 

35 days, every week, or biweekly depending on the half-life and clearance rate of the particular 
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Normal dosage amounts may vary from about 0.1 to 100,000 ;zg, up to a total dose of 
about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art. 
5 Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 
conditions, locations, etc. 
DIAGNOSTICS 

In another embodiment, antibodies which specifically bind HSECP may be used for the 

10 diagnosis of disorders characterized by expression of HSECP, or in assays to monitor patients being 
treated with HSECP or agonists, antagonists, or inhibitors of HSECP. Antibodies useful for 
diagnostic purposes may be prepared in the same manner as described above.for therapeutics. 
Diagnostic assays for HSECP include methods which utilize the antibody and a label to detect 
HSECP in human body fluids or in extracts of cells or tissues. The antibodies may be used with or 

15 without modification, and may be labeled by covalent or non-covalent attachment of a reporter 

molecule. A wide variety of reporter molecules, several of which are described above, are known in 
the art and may be used. 

A variety of protocols for measuring HSECP, including ELISAs, RIAs, and FACS, are known 
in the art and provide a basis for diagnosing altered or abnormal levels of HSECP expression. 

20 Normal or standard values for HSECP expression are established by combining body fluids or cell 
extracts taken from normal manunalian subjects, for example, human subjects, with antibody to 
HSECP under conditions suitable for complex formation. The amount of standard complex formation 
may be quantitated by various methods, such as photometric means. Quantities of HSECP expressed 
in subject, control, anjd disease samples from biopsied tissues are compared with the standard values. 

25 Deviation between standard and subject values establishes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding HSECP may be used 
for diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences, 
complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect 
and quantify gene expression in biopsied tissues in which expression of HSECP may be correlated 

30 with disease. The diagnostic assay may be used to determine absence, presence, and excess 

expression of HSECP, and to monitor regulation of HSECP levels during therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting polynucleotide 
sequences, including genomic sequences, encoding HSECP or closely related molecules may be used 
to identify nucleic acid sequences which encode HSECP. The specificity of the probe, whether it is 

35 made from a highly specific region, e.g., the 5' regulatory region, or from a less specific region, e.g., a 
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conserved motif, and the stringency of the hybridization or amplification will determine whether the 
probe identifies only naturally occurring sequences encoding HSECP, allelic variants, or related 
sequences. 

Probes may also be used for the detection of related sequences, and may have at least 50% 

5 sequence identity to any of the HSECP encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:23-44 or from 
genomic sequences including promoters, enhancers, and introns of the HSECP gene. 

Means for producing specific hybridization probes for DNAs encoding HSECP include the 
cloning of polynucleotide sequences encoding HSECP or HSECP derivatives into vectors for the 

10 production of mRNA probes. Such vectors are known in the art, are commercially available, and may 
be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA 
polymerases and the appropriate labeled nucleotides. Hybridization probes n^ay be labeled by a 
variety of reporter groups, for example, by radionuclides such as ^ or ^^S, or by enzymatic labels, 
such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 

IS Polynucleotide sequences encoding HSECP may be used for the diagnosis of disorders 

associated with expression of HSECP. Examples of such disorders include, but are not limited to, a 
cancer such as adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, 
teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, 
breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, 

20 pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and. 
uterus; an inflammatory disorder such as acquired immunodeficiency syndrome (AIDS), Addison's 
disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, 
asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune 
polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, cholecystitis, contact 

25 dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, 

episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic 
gastritis, glomerulonephritis, Goodpasture's syndrome, gout. Graves' disease, Hashimoto's 
thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, 
myocardial or pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, 

30 psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic 

anaphylaxis, systemic lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative 
colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and extracorporeal 
circulation, viral, bacterial, fungal, parasitic, protozoal, and helminthic infections, and trauma; a 
gastrointestinal disorder such as dysphagia, peptic esophagitis, esophageal spasm, esophageal 

35 stricture, esophageal carcinoma, dyspepsia, indigestion, gastritis, gastric carcinoma, anorexia, nausea, 
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emesis, gastroparesis, antral or pyloric edema, abdominal angina, pyrosis, gastroenteritis, intestinal 
obstruction, infections of the intestinal tract, peptic ulcer, cholelithiasis, cholecystitis, cholestasis, 
pancreatitis, pancreatic carcinoma, biliary tract disease, hepatitis, hyperbilirubinemia, cirrhosis, 
passive congestion of the liver, hepatoma, infectious colitis, ulcerative colitis, ulcerative proctitis, 
5 Crohn's disease, Whipple's disease, Mallory- Weiss syndrome, colonic carcinoma, colonic 

obstruction, irritable bowel syndrome, short bowel syndrome, diarrhea, constipation, gastrointestinal 
hemorrhage, acquired immunodeficiency syndrome (AIDS) enteropathy, jaundice, hepatic 
encephalopathy, hepatorenal syndrome, hepatic steatosis, hemochromatosis, Wilson's disease, alpha ,- 
antitrypsin deficiency, Reye's syndrome, primary sclerosing cholangitis, liver infarction, portal vein 

10 obstruction and thrombosis, centrilobular necrosis, peliosis hepatis, hepatic vein thrombosis, veno- 
occlusive disease, preeclampsia, eclampsia, acute fatty liver of pregnancy, intrahepatic cholestasis of 
pregnancy, and hepatic tumors including nodular hyperplasias; adenomas, and carcinomas; a 
cardiovascular disorder, and in particular, a disorder of the heart such as congestive heart faihire, 
ischemic heart disease, angina pectoris, myocardial infarction, hypertensive heart disease, 

15 degenerative valvular heart disease, calcific aortic valve stenosis, congenitally bicuspid aortic valve, 
mitral annular calcification, mitral valve prolapse, rheumatic fever and rheumatic heart disease, 
infective endocarditis, nonbacterial thrombotic endocarditis, endocarditis of systemic lupus 
erythematosus, carcinoid heart disease, cardiomyopathy, myocarditis, pericarditis, neoplastic heart 
disease, congenital heart disease, and complications of cardiac transplantation; and a neurological 

20 disorder such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's 
disease. Pick's disease, Huntington's disease, dementia, Parkinson's disease and other extrapyramidal 
disorders, amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural 
muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating 
diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural abscess, 

25 suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous system 
disease, prion diseases including kuru, Creutzfeldt- Jakob disease, and Gerstmann- 
Straussler-Scheinker syndrome, fatal familial insomnia^ nutritional and metabolic diseases of the 
nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 
encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central 

30 nervous system, cerebral palsy, neuroskeletal disorders, autonomic nervous system disorders, cranial 
nerve disorders, spinal cord diseases, muscular dystrophy and other neuromuscular disorders, 
peripheral nervous system disorders, dermatomyositis and polymyositis, inherited, metabolic, 
endocrine,, and toxic myopathies, myasthenia gravis, periodic paralysis, mental disorders including 
mood, anxiety, and schizophrenic disorders, seasonal affective disorder (SAD), akathesia, amnesia, 

35 catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic 
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neuralgia, and Tourette's disorder. The polynucleotide sequences encoding HSECP may be used in 
Southern or northern analysis, dot blot, or other membrane-based technologies; in PCR technologies; 
in dipstick, pin, and multiformat ELISA-like assays; and in microarrays utilizing fluids or tissues 
from patients to detect altered HSECP expression. Such qualitative or quantitative methods are well 
5 known in the art. 

In a particular aspect, the nucleotide sequences encoding HSECP may be useful in assays that 
detect the presence of associated disorders, particularly those mentioned above. The nucleotide 
sequences encoding HSECP may be labeled by standard methods and added to a fluid or tissue 
sample from a patient under conditions suitable for the formation of hybridization complexes. After a 

10 suitable incubation period, the sample is washed and the signal is quantified and compared with a 
standard value. If the amount of signal in the patient sample is significantly altered in comparison to 
a control sample then the presence of altered levels of nucleotide sequences encoding HSECP in the 
sample indicates the presence of the associated disorder. Such assays may also be used to evaluate 
the efficacy of a particular therapeutic treatment regimen in animal studies, in clinical trials, or to 

15 monitor the treatment of an individual patient. 

In order to provide a basis for the diagnosis of a disorder associated with expression of 
HSECP, a normal or standard profile for expression is established. This may be accomplished by 
combining body fluids or cell extracts taken from normal subjects, either animal or human, with a 
sequence, or a fragment thereof, encoding HSECP, under conditions suitable for hybridization or 

20 amplification. Standard hybridization may be quantified by comparing the values obtained from 
normal subjects with values fix)m an experiment in which a known amount of a substantially purified 
polynucleotide is used. Standard values obtained in this manner may be compared with values 
obtained from samples from patients who are symptomatic for a disorder. Deviation from standard 
values is used to establish the presence of a disorder. 

25 Once the presence of a disorder is established and a treatment protocol is initiated, 

hybridization assays may be repeated on a regular basis to determine if the level of expression in the 
patient begins to approximate that which is observed in the normal subject. The results obtained from 
successive assays may be used to show the efficacy of treatment over a period ranging from several 
days to months. 

30 With respect to cancer, the presence of an abnormal amount of transcript (either under- or 

overexpressed) in biopsied tissue from an individual may indicate a predisposition for the 
development of the disease, or may provide a means for detecting the disease prior to the appearance 
of actual clinical symptoms. A more definitive diagnosis of this type may allow health professionals 
to employ preventative measures or aggressive treatment earlier thereby preventing the development 

35 or further progression of the cancer. 
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Additional diagnostic uses for oligonucleotides designed from the sequences encoding 
HSECP may involve the use of PGR. These oligomers may be chemically synthesized, generated 
enzymatically, or produced in vitro . Oligomers will preferably contain a fragment of a polynucleotide 
encoding HSECP, or a fragment of a polynucleotide complementary to the polynucleotide encoding 
5 HSECP, and will be employed under optimized conditions for identification of a specific gene or 
condition. Oligomers may also be employed under less stringent conditions for detection or 
quantification of closely related DNA or RNA sequences. 

Methods which may also be used to quantify the expression of HSECP include radiolabeling 
or biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 
10 standard curves. (See, e.g., Melby. P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. 
et al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples may be 
accelerated by running the assay in a high-throughput format where the oligomer of interest is 
presented in various dilutions and a spectrophotometric or colorimetric response gives rapid 
quantitation. 

15 In further embodiments, oligonucleotides or longer fragments derived from.any of the 

polynucleotide sequences described herein may be used as targets in a microarray. The microarray 
can be used to monitor the expression level of large numbers of genes simultaneously and to identify 
genetic variants, mutations, and polymorphisms. This information may be used to determine gene 
function, to understand the genetic basis of a disorder, to diagnose a disorder, and to develop and 

20 monitor the activities of therapeutic agents. 

Microarrays may be prepared, used, and analjfzed using methods known in the art. (See, e.g.. 
Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. Acad. Sci. 
USA 93: 10614-10619; Baldeschweiler et al. (1995) PCT application W095/251 1 16; Shalon, D. et al. 
(1995) PCT application WO95/35505; Heller, RA. et al. (1997) Proc. Natl. Acad. Sci. USA 94:2150- 

25 2155; and Heller, M,J. et al. (1997) U.S. Patent No. 5,605,662.) 

In another embodiment of the invention, nucleic acid sequences encoding HSECP may be 
used to generate hybridization probes useful in mapping the naturally occurring genomic sequence. 
The sequences may be mapped to a particular chromosome, to a specific region of a chromosome, or 
to artificial chromosome constructions, e.g., human artificial chromosomes (HACs), yeast artificial 

30 chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial PI constructions, or single 
chromosome cDNA libraries. (See, e.g., Harrington, J.J. et al. (1997) Nat. Genet. 15:345-355; Price, 
CM. (1993) Blood Rev. 7:127-134; and Trask, BJ. (1991) Trends Genet. 7:149-154.) 

Fluorescent in situ hybridization (FISH) may be correlated with other physical chromosome 
mapping techniques and genetic map data. (See, e.g., Heinz-Ulrich, et al. (1995) in Meyers, supra . 

35 pp. 965-968.) Examples of genetic map data can be found in various scientific journals or at the 
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Online Mendelian Inheritance in Man (OMIM) World Wide Web site. Correlation between the 
location of the gene encoding HSECP on a physical chromosomal map and a specific disorder, or a 
predisposition to a specific disorder, may help deflne the region of DNA associated with that 
disorder. The nucleotide sequences of the invention may be used to detect differences in gene 
5 sequences among normal, carrier, and affected individuals. 

In situ hybridization of chromosomal preparations and physical mapping techniques, such as 
linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
Often the placement of a gene on the chromosome of another mammalian species, such as mouse, 
may reveal associated nmrkers even if the number or arm of a particular human chromosome is not 

10 known. New sequences can be assigned to chromosomal arms by physical mapping. This provides 
valuable information to investigators searching for disease genes using positional cloning or other 
gene discovery techniques. Once the disease or syndrome has been crudely localized by genetic 
linkage to a particular genomic region, e.g., ataxia-telangiectasia to 1 lq22-23, any sequences mapping 
to that area may represent associated or regulatory genes for further investigation. (See, e.g., Gatti, 

15 R.A. et al. (1988) Nature 336:577-580.) The nucleotide sequence of the subject invention may also 
be used to detect differences in the chromosomal location due to translocation, inversion, etc., among 
normal, carrier, or affected individuals. 

In another embodiment of the invention, HSECP, its catalytic or immunogenic fragments, or 
oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 

20 screening techniques. The fragment employed in such screening may be free in solution, affixed to a 
solid support, borne on a cell surface, or located intracellularly. The formation of binding complexes 
between HSECP and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of compounds 
having suitable binding affinity to the protein of interest. (See, e.g., Geysen, et al. (1984) PCT 

25 application WO84/03S64.) In this method, large numbers of different small test compounds are 

synthesized on a solid substrate. The test compounds are reacted with HSECP, or fragments thereof, 
and washed. Bound HSECP is then detected by methods well known in the art. Purified HSECP can 
also be coated directly onto plates for use in the aforementioned drug screening techniques. 
Alternatively, non-neutralizing antibodies can be used to capture the peptide and immobilize it on a 

30 solid support. 

In another embodiment, one may use competitive drug screening assays in which neutralizing 
antibodies capable of binding HSECP specifically compete with a test compound for binding HSECP. 
In this manner, antibodies can be used to detect the presence of any peptide which shares one or more 
antigenic determinants with HSECP. 
35 In additional embodiments, the nucleotide sequences which encode HSECP may be used in 
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any molecular biology techniques that have yet to be developed, provided the new techniques rely on 
properties of nucleotide sequences that are currently known, including, but not limited to, such 
properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 
5 description, utilize the present invention to its fullest extent. The following preferred specific 

embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder 
of the disclosure in any way whatsoever. 

Without further elaboration, it is believed diat one skilled in the art can, using the preceding 
description, utilize the present invention to its fullest extent. The following preferred specific 
10 embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder 
of the disclosure in any way whatsoever. 

The disclosures of all patents, applications, and publications mentioned above and below, in 
particular U.S. Ser. No. 60/123,1 17, are hereby expressly incorporated by reference. 

15 EXAMPLES 
I. Construction of cDNA Libraries 

RNA was purchased from Clontech or isolated from tissues described in Table 4. Some 
tissues were homogenized and lysed in guanidinium isothiocyanate, while others were homogenized 
and lysed in phenol or in a suitable mixture of denaturants, such as TRIZOL (Life Technologies), a 

20 monophasic solution of phenol and guanidine isothiocyanate. The resulting lysates were centrifiiged 
over CsCl cushions or extracted with chloroform. RNA was precipitated from the lysates with either 
isopropanol or sodium acetate and ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity. In some cases, RNA was treated with DNase. For most libraries, poly(A+) RNA was isolated 

25 using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles (QIAGEN, 
Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, RNA was 
isolated directly firom tissue lysates using other RNA isolation kits, e.g., the POLY(A)PURE mRNA 
purification kit (Ambion, Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 

30 libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the UNIZAP 
vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using the 
recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 1997, supra , units 
5.1-6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic 
oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was digested with the 

35 appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300- 
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1000 bp) using SEPHACRYL SIOOO, SEPHAROSE CL2B. or SEPHAROSE CL4B column 
chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs 
were ligated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g., 
PBLUESCRIFT plasmid (Stratagene), PSPORTl plasmid (Life Technologies), pcDNA2.1 plasmid 

5 (Invitrogen, Carlsbad CA), or pINCY plasmid (Incyte Pharmaceuticals, Palo Alto CA). Recombinant 
plasmids were transformed into competent E. coli cells including XLl-Blue, XLl-BlueMRF, or 
SOLR from Stratagene or DH5a, DHIOB, or ElectroMAX DHIOB from Life Technologies, 
n. Isolation of cDN A Clones 

Plasmids were recovered from host cells by in vivo excision using the UNEZAP vector system 

10 (Stratagene) or by cell lysis. Plasmids were purified using at least one of the following: a Magic or 
WIZARD Minipreps DNA purification system (Promega); an AGTC Miniprep purification kit (Edge 
Bipsystems, Gaithersburg MD); and QIAWELL 8 Plasmid, QL\WELL 8 Plus Plasmid, QIAWELL 8 
Ultra Plasmid purification systems or the R.E.A.L. PREP 96 plasmid purification kit from QIAGEN. 
Following precipitation, plasmids were resuspended in 0.1 ml of distilled water and stored, with or 

15 without lyophilization, at 4°C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a 
high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 
384-well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically 

20 using PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN H fluorescence 
scanner (Labsystems Oy, Helsinki, Finland). 
III. Sequencing and Analysis 

cDNA sequencing reactions were processed using standard methods or high-throughput 
instramentation such as the ABI CATALYST 800 (Perkin-Elmer) thermal cycler or the PTG-200 

25 thermal cycler (MJ Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) 
or the MICROLAB 2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were 
prepared using reagents provided by Amersham Pharmacia Biotech or supplied in ABI sequencing 
kits such as the ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Perkin-Elmer). 
Electrophoretic separation of cDNA sequencing reactions and detection of labeled polynucleotides 

30 were carried out using the MEGABACE lOCX) DNA sequencing system (Molecular Dynamics); the 
ABI PRISM 373 or 377 sequencing system (Perkin-Elmer) in conjunction with standard ABI 
protocols and base calling software; or other sequence analysis systems known in the art. Reading 
frames within the cDNA sequences were identified using standard methods (reviewed in Ausubel, 
1997, supra , unit 7.7). Some of the cDNA sequences were selected for extension using the techniques 

35 disclosed in Example V. 
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The polynucleotide sequences derived from cDNA sequencing were assembled and analyzed 
using a combination of software programs which utilize algorithms well known to those skilled in the 
art. Table 5 summarizes the tools, programs, and algorithms used and provides applicable 
descriptions, references, and threshold parameters. The first column of Table S shows the tools, 
5 programs, and algorithms used, the second column provides brief descriptions thereof, the third 
column presents appropriate references, all of which are incorporated by reference herein in their 
entirety, and the fourth column presents, where applicable, the scores, probability values, and other 
parameters used to evaluate the strength of a match between two sequences (the higher the score, the 
greater the homology between two sequences). Sequences were analyzed using MACDNASB PRO 
10 software (Hitachi Software Engineering. South San Francisco CA) and LASERGENE software 

(DNASTAR). Polynucleotide and polypeptide sequence alignments were generated using the default 
parameters specified by the clustal algorithm as incorporated into the MEGALIGN multisequence 
alignment program (DNASTAR), which also calculates the percent identity between aligned 
sequences. 

15 The polynucleotide sequences were validated by removing vector, linker, and polyA 

sequences and by masking ambiguous bases, using algorithms and programs based on BLAST, 
dynamic programing, and dinucleotide nearest neighbor analysis. The sequences were then queried 
against a selection of public databases such as the GenBank primate, rodent, mammalian, vertebrate, 
and eukaryote databases, and BLOCKS, PRINTS, DOMO, PRODOM, and PFAM to acquire 

20 annotation using programs based on BLAST, FASTA, and BLIMPS. The sequences were assembled 
into full length polynucleotide sequences using programs based on Phred, Phrap, and Consed, and 
were screened for open reading frames using programs based on GeneMark, BLAST, and FASTA. 
The full length polynucleotide sequences were translated to derive the corresponding full length 
aniind acid sequences, and these full length sequences were subsequently analyzed by querying 

25 against databases such as the GenBank databases (described above), SwissProt, BLOCKS, PRINTS, 
DOMO, PRODOM, Prosite, and Hidden Markov Model (HMM)-based protein family databases such 
as PFAM. HMM is a probabilistic approach which analyzes consensus primary structures of gene 
families. (See, e.g., Eddy, S.R. (1996) Curr. Opin. Struct. Biol. 6:361-365.) 

The programs described above for the assembly and analysis of full length polynucleotide 

30 and amino acid sequences were also used to identify polynucleotide sequence fragments from SEQ ID 
NO:23-44. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and 
amplification technologies were described in The Invention section above. 
IV. Northern Analysis 

Northern analysis is a laboratory, technique used to detect the presence of a transcript of a 

35 gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs 
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from a particular cell type or tissue have been bound. (See, e.g., Sambrook, supra, ch. 7; Ausubel, 
1995, supra, ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or related 
molecules in nucleotide databases such as GenBank or LIFESEQ (Incyte Pharmaceuticals). This 
5 analysis is much faster than multiple membrane-based hybridizations. In addition, the sensitivity of 
the computer search can be modified to determine whether any particular match is categorized as 
exact or similar. The basis of the search is the product score, which is defined as: 

% sequence identitv x % maximum BLAST score 
100 

10 The product score takes into account both the degree of similarity between two sequences and the 
length of the sequence match. For example, with a product score of 40, the match will be exact 
within a 1 % to 2% error, and, with a product score of 70, the match will be exact. Similar molecules 
are usually identified by selecting those which show product scores between IS and 40, although 
lower scores may identify related molecules. 

15 The results of northern analyses are reported as a percentage distribution of libraries in which 

the transcript encoding HSECP occurred. Analysis involved the categorization of cDNA libraries by 
organ/tissue and disease. The organ/tissue categories included cardiovascular, dermatologic, 
developmental, endocrine, gastrointestinal, hematopoietic/immune, musculoskeletal, nervous, 
reproductive, and urologic. The disease/condition categories included cancer, inflammation, trauma, 

20 cell proliferation, neurological, and pooled. For each category, the number of libraries expressing the 
sequence of interest was counted and divided by the total number of libraries across all categories. 
Percentage values of tissue-specific and disease- or condition-specific expression are reported in 
Tables. 

v. Extension of HSECP Encoding Polynucleotides 

25 The full length nucleic acid sequences of SEQ ID NO:23-44 were produced by extension of 

an appropriate fragment of the full length molecule using oligonucleotide primers designed from this 
fragment. One primer was synthesized to initiate 5' extension of the known fragment, and the other 
primer, to initiate 3' extension of the icnown fragment. The initial primers were designed using 
OLIGO 4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 

30 nucleotides in length, to have a GC content of about 50% or more, and to anneal to the target 
sequence at temperatures of about SS^'C to about 72°C. Any stretch of nucleotides which would 
result in hairpin structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one 
extension was necessary or desired, additional or nested sets of primers were designed. 

35 High fidelity amplification was obtained by PCR using methods well known in the art. PGR 
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was performed in 96-well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The reaction 
mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg^*, 0^4)2804, 
and P-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme 
(Life Technologies), and Pfij DNA polymerase (Stratagene), with the following parameters for primer 

5 pair PCI A and PCI B: Step 1 : 94*C, 3 min; Step 2: 94**C, 1 5 sec; Step 3: eO^'C, 1 min; Step 4: 68**C, 
2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68 ^'C, 5 min; Step 7: storage at 4°C. In the 
alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1: 94 *C, 3 min; Step 2: 
94 15 sec; Step 3: 57°C, 1 min; Step 4: GS^'C. 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; 
Step 6: 68**C, 5 min; Step 7: storage at 4''C. 

16 The concentration of DNA in each well was determined by dispensing 100 |il PICOGREEN 

quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 0.5 ^1 of undiluted PCR product into each well of an opaque fluorimeter plate (Coming Costar, 
Acton MA), allowing the DNA to bind to the reagent. The plate was scanned in a Fluoroskan II 
(Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 

15 concentration of DNA. A 5 //I to 10 //l aliquot of the reaction mixture was analyzed by 

electrophoresis on a 1 % agarose mini-gel to determine which reactions were successful in extending 
the sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-well plates, 
digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 

20 sonicated or sheared prior to religation into pUC 18 vector (Amersham Pharmacia Biotech). For 
shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) 
agarose gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended clones 
were religated using T4 ligase (New England Biolabs, Beveriy MA) into pUC 18 vector (Amersham 
Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fiU-in restriction site 

25 overhangs, and transfected into competent E. coli cells. Transformed cells were selected on 

antibiotic-containing media, individual colonies were picked and cultured overnight at 37 ''C in 384- 
well plates in L6/2x carb liquid media. 

The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase 
(Amersham Pharmacia Biotech) and Pfii DNA polymerase (Stratagene) with the following 

30 parameters: Step 1: 94 ""C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60^C, 1 min; Step 4: 72°C, 2 min; 
Step 5: steps 2. 3, and 4 repeated 29 times; Step 6: 72''C, 5 min; Step 7: storage at 4*'C. DNA was 
quantified by PICOGREEN reagent (Molecular Probes) as described above. Samples with low DNA 
recoveries were reamplified using the same conditions as described above. Samples were diluted 
with 20% dimethysulfoxide (1:2, v/v), and sequenced using DYENAMIC energy transfer sequencing 

35 primers and the DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM 

52 



wo 00/52151 



PCT/USOO/05621 



BIGDYE Tenninator cycle sequencing ready reaction kit (Perkin-Elmer). 

In like manner, the nucleotide sequences of SEQ ID NO:23-44 are used to obtain S' 
regulatory sequences using the procedure above, oligonucleotides designed for such extension, and an 
appropriate genomic library. 

5 VI. Labeling and Use of Individual Hybridization Probes 

Hybridization probes- derived from SEQ ID NO:23-44 are employed to screen cDNAs, 
genomic DNAs» or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base 
pairs, is specifically described, essentially the same procedure is used with larger nucleotide 
fragments. Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 

10 software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 /iCi of 
[y.^^P] adenosine triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase 
(DuPont NEN, Boston MA). The labeled oligonucleotides are substantially purified using a 
SEPHADEX G-25 superfine size exclusion dextran bead column (Amersham Pharmacia Biotech). 
An aliquot containing 10^ counts per minute of the labeled probe is used in a typical membrane-based 

15 hybridization analysis of human genomic DNA digested with one of the following endonucleases: 
Ase I, Bgl n. Eco RI, Pst I, Xba I, or Pvu n (DuPont MEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 
membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 
hours at 40*^0. To remove nonspecific signals, blots are sequentially washed at room temperature 

20 under conditions of up to, for example, 0. 1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
compared. 

VII. Microarrays 

A chemical coupling procedure and an Inkjet device can be used to synthesize array 
25 elements on the surface of a substrate. (See, e.g., Baldeschweiler, supra .) An array analogous to a 
dot or slot blot may also be used to arrange and link elements to the surface of a substrate using 
thermal, UV, chemical, or mechanical bonding procedures. A typical array may be produced by hand 
or using available methods and machines and contain any appropriate number of elements. After 
hybridization, nonhybridized probes are removed and a scanner used to determine the levels and 
30 patterns of fluorescence. The degree of complementarity and the relative abundance of each probe 
which hybridizes to an element on the microarray may be assessed through analysis of the scanned 
images. 

Full-length cDNAs, Expressed Sequence Tags (ESTs), or fragments thereof may comprise 
the elements of the microarray. Fragments suitable for hybridization can be selected using software 
35 well known in the art such as LASERGENE software (DNASTAR). Full-length cDNAs, ESTs, or 



53 



wo 00/52151 



PCTAJS00/05d21 



fragments thereof corresponding to one of the nucleotide sequences of the present invention, or 
selected at random from a cDNA library relevant to the present invention, are arranged on an 
appropriate substrate, e.g., a glass slide. The cDNA is fixed to the slide using, e.g., UV cross-linking 
followed by thennal and chemical treatments and subsequent drying. (See, e.g., Schena, M. et al. 

5 (1995) Science 270:467-470; Shalon, D. et al. (1996) Genome Res. 6:639-645.) Fluorescent probes 
are prepared and used for hybridization to the elements on the substrate. The substrate is analyzed by 
procedures described above. 
Vin. Complementary Polynucleotides 

Sequences complementary to the HSECP-encoding sequences, or any parts thereof, are used 

10 to detect, decrease, or inhibit expression of naturally occurring HSECP. Although use of 
oligonucleotides comprising from about 15 to 30 base pairs is described, essentially the same 
procedure is used with smaller or with larger sequence fragments. Appropriate oligonucleotides are 
designed using OLIGO 4.06 software (National Biosciences) and the coding sequence of HSECP. To 
inhibit transcription, a complementary oligonucleotide is designed from the most unique 5* sequence 

15 and used to prevent promoter binding to the coding sequence. To inhibit translation, a 

complementary oligonucleotide is designed to prevent ribosomal binding to the HSECP-encoding 
transcript. 

IX. Expression of HSECP 

Expression and purification of HSECP is achieved using bacterial or virus-based expression 

20 systems. For expression of HSECP in bacteria, cDNA is subcloned into an appropriate vector 

containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA 
transcription. Examples of such promoters include, but are not limited to, the trp-lac (tac) hybrid 
promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac operator regulatory 
element. Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). 

25 Antibiotic resistant bacteria express HSECP upon induction with isopropyl beta-D- 

thiogalactopyranoside (IPTG). Expression of HSECP in eukaryotic cells is achieved by infecting 
insect or mammalian cell lines with recombinant Autoeraphica califomica nuclear polyhedrosis virus 
(AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of baculovirus is 
replaced with cDNA encoding HSECP by either homologous recombination or bacterial-mediated 

30 transposition involving transfer plasmid intermediates. Viral infectivity is maintained and the strong 
polyhedrin promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to 
infect Spodoptera frugiperda (Sf9) insect cells in most cases, or human hepatocytes, in some cases. 
Infection of the latter requires additional genetic modifications to baculovirus. (See Engelhard, E.K. 
et al. (1994) Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 

35 7:1937-1945.) 
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. In most expression systems, HSECP is synthesized as a fusion protein with, e.g., glutathione 
S-transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 
aifmity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26- 
kilodalton enzyme from Schistosoma iaponicum , enables the purification of fusion proteins on 

5 immobilized glutathione under conditions that maintain protein activity and antigenicity (Amersham 
Pharmacia Biotech). Following purification, the GST moiety can be proteolytically cleaved from 
HSECP at specifically engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffinity 
purification using conunercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman 
Kodak). 6-His, a stretch of six consecutive histidine residues, enables purification on metal-chelate 

10 resins (QIAGEN). Methods for protein expression and purification are discussed in Ausubel (1995, 
supra, ch. 10 and 16). Purified HSEC!P obtained by these methods can be used directly in the 
following activity assay. 
X. Demonstration of HSECP Activity 

An assay for HSECP activity measures the expression of HSECP on the cell surface. cDNA 

IS encoding HSECP is subcloned into an appropriate mammalian expression vector suitable for high 
levels of cDNA expression. The resulting construct is transfected into a nonhuman cell line such as 
NIH3T3. Cell surface proteins are labeled with biotin using methods known in the art. 
Immunoprecipitations are performed using HSECP-specific antibodies, and immunoprecipitated 
samples are analyzed using SDS-PAGE and immunoblotting techniques. The ratio of labeled 

20 immunoprecipitant to unlabeled immunoprecipitant is proportional to the amount of HSECP 
expressed on the cell surface. 

Alternatively, an assay for HSECP activity measures the amount of HSECP in secretory, 
membrane-bound organelles. Transfected cells as described above are harvested and lysed. The 
lysate is fractionated using methods known to those of skill in the art, for example, sucrose gradient 

25 ultracentrifiigation. Such methods allow the isolation of subcellular components such as the Golgi 
apparatus, ER, small membrane-bound vesicles, and other secretory, organelles. 
Immunoprecipitations from fractionated and total cell lysates are performed using HSECP-specific 
antibodies, and immunoprecipitated samples are analyzed using SDS-PAGE and immunoblotting 
techniques. The concentration of HSECP in secretory organelles relative to HSECP in total cell 

30 lysate is proportional to the amount of HSECP in transit through the secretory pathway. 
XL Functional Assays 

HSECP function is assessed by expressing the sequences encoding HSECP at 
physiologically elevated levels in mammalian cell culture systems. cDNA is subcloned into a 
mammalian expression vector containing a strong promoter that drives high levels of cDNA 

35 expression. Vectors of choice include pCMV SPORT plasmid (Life Technologies) and pCR3.1 
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plasmid (Invitrogen), both of which contain the cytomegalovirus promoter. 5-10 //g of recombinant 
vector are transiently transfected into a human cell line, for example, an endothelial or hematopoietic 
cell line, using either liposome formulations or electroporation. 1-2 /ig of an additional plasmid 
containing sequences encoding a marker protein are co-transfected. Expression of a marker protein 

5 provides a means to distinguish transfected cells from nontransfected cells and is a reliable predictor 
of cDNA expression from the recombinant vector. Marker proteins of choice include, e.g., Green 
Fluorescent Protein (GFP; Clontech), CD64, or a CD64-GFP fusion protein. Flow cytometry (FCM), 
an automated, laser optics-based technique, is used to identify transfected cells expressing GFP or 
CD64-GFP and to evaluate the q)optotic state of the cells and other cellular properties. FCM detects 

10 and quantifies the uptake of fluorescent molecules that diagnose events preceding or coincident with 
cell death. These events include changes in nuclear DNA content as measured by staining of DNA 
with propidium iodide; changes in cell size and granularity as measured by forward light scatter and 
90 degree side light scatter; down-regulation of DNA synthesis as measured by decrease in 
bromodeoxyuridine uptake; alterations in expression of cell surface and intracellular proteins as 

15 measured by reactivity with specific antibodies; and alterations in plasma membrane composition as 
measured by the binding of fluorescein-conjugated Annexin V protein to the cell surface. Methods in 
flow cytometry are discussed in Ormerod, M.G, (1994) Flow Cvtometrv > Oxford, New York NY. 

The influence of HSECP on gene expression can be assessed using highly purified 
populations of cells transfected with sequences encoding HSECP and either CD64 or CD64-GFP. 

20 CD64 and CD64-GFP are expressed on the surface of transfected cells and bind to conserved regions 
of human immunoglobulin G (IgG). Transfected cells are efTiciently separated from nontransfected 
cells using magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake 
Success NY). mRNA can be purified from the cells using methods well known by those of skill in 
the art. Expression of mRNA encoding HSECP and other genes of interest can be analyzed by 

25 northern analysis or microarray techniques. 

XIL Production of HSECP Specific Antibodies 

HSECP substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
Harrington, M.G. ( 1 990) Methods Enzymol. 1 82:488-495), or other purification techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. 

30 Alternatively, the HSECP amino acid sequence is analyzed using LASERGENE software 

(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 
selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well 
described in the art. (See, e.g., Ausubel, 1995, supra , ch. 11.) 

35 Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 431 A 
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peptide synthesizer (Perldn-EImer) using fmoc-chemistiy and coupled to KLH (Sigma-Aldrich, St. 

Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccininiide ester (MBS) to increase 

inununogenicity. (See, e.g., Ausubel, 1995, supra .) Rabbits are inununized with the oligopeptide- 

KLH complex in complete Freund's adjuvant. Resulting antisera are tested for antipeptide and anti- 
5 HSECP activity by, for example, binding the peptide or HSECP to a substrate, blocking with 1% 

' BSA, reacting with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG. . 

Xm. Purification of Naturally Occurring HSECP Using Specific Antibodies 

Naturally occurring or recombinant HSECP is substantially purified by immunoaffmity 

chromatography using antibodies specific for HSECP. An immunoafOnity column is constructed by 
10 covalently coupling anti-HSECP antibody to an activated chromatographic resin, such as 

CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 

blocked and washed according to the manufacturer's instructions. 

Media containing HSECP are passed over the immunoaffmity colunm, and the column is 

washed under conditions that allow the preferential absorbance of HSECP (e.g., high ionic strength 
15 buffers in the presence of detergent). The column is eluted under conditions that disrupt 

antibody/HSECP binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such 

as urea or thiocyanate ion), and HSECP is collected. 

XIV. Identification of Molecules Which Interact with HSECP 

HSECP, or biologically active fragments thereof, are labeled with "^I Bolton*Hunter 
20 reagent. (See, e.g., Bolton A,E. and W.M. Hunter (1973) Biochem. J. 133:529-539.) Candidate 

molecules previously arrayed in the wells of a multi-well plate are incubated with the labeled HSECP, 

washed, and any wells with labeled HSECP complex are assayed. Data obtained using different 

concentrations of HSECP are used to calculate values for the number, affinity, and association of 

HSECP with the candidate molecules. 
25 Alternatively, molecules interacting with HSECP are analyzed using the yeast two-hybrid 

system as described in Fields, S. and O. Song (1989, Nature 340:245-246), or using commercially 

available kits based on the two-hybrid system, such as the MATCHMAKER system (Clontech). 

Various modifications and variations of the described methods and systems of the invention 
30 will be apparent to those skilled in the art without departing from the scope and spirit of the 

invention. Although the invention has been described in connection with certain embodiments, it 
should be understood that the invention as claimed should not be unduly limited to such specific 
embodiments. Indeed, various modifications of the described modes for carrying out the invention 
which are obvious to those skilled in molecular biology or related fields are intended to be within the 
35 scope of the following claims. 
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Vector 1 


PBLUESCRIPT 
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PSPORTl II 


PSPORTl 




pINCY 


pINCY 


pINCY 


Disease or Condition 
(Fraction of Total) 


Inflammation (0.750) 
Cancer (0.250) 


Cancer (0.476) 
Inflammation (0.333) 


Cancer (0.608) 

Inflammation (0.196) 

Cell Proliferation (0.118) 


Cancer (0.600) 

Cell Proliferation (0.178) 

Inflammation (0.133) 


Cancer (0.667) Cell 
Proliferation (0.133) 
Inflammation (0.089) 


Cancer (0.452) 
Inflammation (0 .205) 
Trauma (0.164) 


Trauma (0.600) 
Cancer (0.200) 
Inflammation (0.200) 


Cancer (0.521) 

Inflammation (0.207) 

Cell Proliferation (0.172) 


Tissue Expression 
(Fraction of Total) 


Musculoskeletal (0.750) 
Reproductive (0 .250) 


Reproductive (0.333) 
Musculoskeletal (0.190) 
Cardiovascular (0. 143) 


Reproductive (0.314) 
Nervous (0.235) 
.Gastrointestinal (0.157) 


Reproductive (0.333) 
Nervous (0.178) 
Cardiovascular (0.156) 


Reproductive (0 . 333 ) 
Cardiovascular (0.244) 
Gastrointestinal ( 0 . Ill ) 


Nervous (0 .301) 
Reproductive (0.219) 
Gastrointestinal ( 0 . 137 ) 


Gastrointestinal (0.800) 
Nervous (0.200) 


Reproductive (0.249) 
Nervous (0.195) 
Gastrointestinal (0.136) 


Selected Fragment (s) 
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What is claimed is: 
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1 . An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of: 

5 a) an amino acid sequence selected from the group consisting of SEQ ID NO: 1-22, 

b) a naturally occurring amino acid sequence having at least 90% sequence identity to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-22, 

c) a biologically active fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-22, and 

10 d) an immunogenic fragment of an amino acid sequence selected from the group consisting 

of SEQ ID NO: 1-22. 

2. An isolated polypeptide of claim 1 selected from the group consisting of SEQ ID N0:1- 

3. An isolated polynucleotide encoding a polypeptide of claim 1 . 

4. An isolated polynucleotide of claim 3 selected from the group consisting of SEQ ID 
NO:23-44. 

20 

5. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 3. 

6. A cell transformed with a recombinant polynucleotide of claim 5. 

25 

7. A transgenic organism comprising a recombinant polynucleotide of claim 5. 

8. A method for producing a polypeptide of claim 1 , the method comprising: 

a) culturing a cell under conditions suitable for expression of the polypeptide, wherein said 
30 cell is transformed with a recombinant polynucleotide, and said recombinant polynucleotide 

comprises a promoter sequence operably linked to a polynucleotide encoding the polypeptide of 
claim 1, and 

b) recovering the polypeptide so expressed. 



22. 

15 



35 



9. An isolated antibody which specifically binds to a polypeptide of claim 1. 

75 
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10. An isolated polynucleotide comprising a polynucleotide sequence selected from the 
group consisting of: 

a) a polynucleotide sequence selected from the group consisting of SEQ ID NO:23-44, 

b) a naturally occurring polynucleotide sequence having at least 70% sequence identity to a 
5 polynucleotide sequence selected from the group consisting of SEQ ID NO:23-44, 

c) a polynucleotide sequence complementary to a), 

d) a polynucleotide sequence complementary to b), and 

e) an RNA equivalent of a)-d). 

10 11. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 

polynucleotide of claim 10. 

12. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 10, the method comprising: 

15 a) hybridizing the sample with a probe comprising at least 16 contiguous nucleotides 

comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization 
complex is formed between said probe and said target polynucleotide, and 

b) detecting the presence or absence of said hybridization complex, and, optionally, if 

20 present, the amount thereof. 

13. A method of claim 12, wherein the probe comprises at least 30 contiguous nucleotides. 

14. A method of claim 12, wherein the probe comprises at least 60 contiguous nucleotides. 

25 

15. A pharmaceutical composition comprising an effective amount of a polypeptide of claim 
1 and a pharmaceutically acceptable excipient. 

16. A method for treating a disease or condition associated with decreased expression of 
30 functional HSECP, comprising administering to a patient in need of such treatment the 

pharmaceutical composition of claim 15. 



35 



17. A method for screening a compound for efiTectiveness as an agonist of a polypeptide of 
claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 
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b) detecting agonist activity in the sample. 

18. A phannaceutical composition comprising an agonist compound identified by a method 
of claim 17 and a phannaceutically acceptable excipient. 

19. A method for treating a disease or condition associated with decreased expression of 
functional HSECP, comprising administering to a patient in need of such treatment a pharmaceutical 
composition of claim 18. 

20. A method for screening a compound for effectiveness as an antagonist of a polypeptide 
of claim 1 , the method comprising: ' 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting antagonist activity in the sample. 

21. A pharmaceutical composition comprising an antagonist compound identified by a 
method of claim 20 and a pharmaceutically acceptable excipient. 

22. A method for treating a disease or condition associated with overexpression of functional 
HSECP, comprising administering to a patient in need of such treatment a pharmaceutical 
composition of claim 21. 

23. 'A method for screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a sequence of claim 4, the method 
comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, and 

b) detecting altered expression of the target polynucleotide. 
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<110> INCYTE PHARMACEUTICALS, INC. 
TANG, Y. Tom 
LAL, Preeti 
. BAUGHN, Mariah R. 
YUE, ^Henry 
AU- YOUNG, Janice 
LU, Dyung Aina M. 
AZIM2AI, Yalda 

<120> HUMAN SECRETORY PROTEINS 



<130> PF-0675 PCT 

<140> To Be Assigned 
<141> Herewith 

<150> 60/123,117 
<151> 1999-03-05 

<160> 44 

<170> PERL Program 

<210> 1 
<211> 182 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 078811CD1 



<400> 1 














Met 


Arg 


Ser 


Thr 


He 


Leu 


Leu 


Phe 


1 








5 








Ser 


Leu 


Pro 


Val 


Phe 
20 


Pro 


Ser 


Leu 


Met 


Leu 


Thr 


Leu 


Gly 


Pro Asp 


Leu 










35 








Gly 


Met 


Thr 


Pro 


Gly 
50 


Thr 


Gin 


Thr 


Leu 


Asn 


Val 


Gin 


Gin 
65 


Gin 


Leu 


His 


Val 


Thr 


Gin 


Leu 


Gly 
80 


Ala 


Pro 


Gly 


He 


Ala 


Thr 


Asn 


Leu 
95 


His 


Glu 


Pro 


Arg 


Glu 


Ala 


Ser 


Leu 
110 


Pro 


Thr 


Ser 


Val 


Gin 


Asp Gly 


Ser 


Leu 


Pro 


Ala 










125 








Ala 


Thr 


Gin Gly 


Thr 


Pro Ala Gly 










140 








Thr 


Asp 


Asp Asp 


Phe 


Ala 


Val 


Thr 










155 








Ser 


Thr 


His 


Ala 


He 
170 


Glu 


Glu 


Ala 


He 


Gin 















Cys 


Leu 


Leu 


Gly Ser 


Thr Arg 




10 








15 


Ser 


Leu 


He 


Pro 


Leu 


Thr Gin 




25 








30 


His 


Leu 


Leu 


Asn 


Pro 


Ala Ala 




40 








45 


His 


Pro 


Leu 


Thr 


Leu 


Gly Gly 




55 








60 


Pro 


His 


Val 


Leu 


Pro 


He Phe 




70 








75 


His 


Tyr 


Pro 


Lys 


Leu 


Arg Gly 




85 








90 


His 


His 


Pro 


Phe 


Leu 


Val Pro 




100 








105 


Gin 


Ala 


Gly 


Ala. 


Asn 


Pro Asp 




115 








120 


Gly Gly 


Ala 


Gly Val 


Asn Pro 




130 








135 


Arg Leu 


Pro 


Thr 


Pro 


Ser Gly 




145 








150 


Thr 


Pro 


Ala 


Gly He 


Gin Arg 




160 








165 


Thr 


Thr 


Glu 


Ser 


Ala 


Asn Gly 




175 








180 



<210> 2 
<211> 125 
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<212> PRT 

<213> Homo sapiens 
<220> - 

<221> misc_feature 

<223> Incyte ID No: 371156CD1 

<400> 2 



Met 


Val 


cys Glu 


Asp 


Ala 


Pro Ser Phe Gin Met 


Ala 


Trp 


Glu 


Ser 


T 
a. 










10 








15 


Gin 


Met 


Ala Trp 


Glu 
A u 


Arg 


Gly Pro Ala Leu Leu 
25 


Cys 


Cys 


Val 


Leu 
30 


Ser 


Ala 


Ser Gin 


Leu 


Ser 


Ser Gin Asp Gin Asp 


Pro 


Leu Gly His 








35 




40 








45 


He 


Lys 


Ser Leu 


Leu 
50 


Tyr 


Pro Phe Gly Phe Pro 
55 


Val 


Glu 


Leu 


Pro 
60 


Arg 


Pro 


Gly Pro 


Thr 


Gly Ala Tyr Lys Lys Val Lys 


Asn 


Gin 


Asn 








65 




70 








75 


Gin 


Thr 


Thr Ser 


Ser 


Glu 


Leu Leu Arg Lys Gin Thr 


Ser 


His 


Phe 








80 




85 








90 


Asn 


Gin 


Arg Gly 


His 


Arg Ala Arg Ser Lys Leu 


Leu 


Ala 


Ser Arg 








95 




100 








105 


Gin 


He 


Pro Asp 


Arg 
110 


Thr 


Phe Lys Cys Gly Lys 
115 


Trp 


Leu 


Pro 


Gin 
120 


Val 


Pro 


Ser Pro 


Val 
125 















<210> 3 
<211> 320 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 584050CD1 

<400> 3 



Met 


Ala 


Gly 


Leu 


Ala 


Ala 


Arg 


Leu Val Leu Leu 


Ala 


Gly Ala Ala 


1 








5 






10 








15 


Ala 


Leu 


Ala 


Ser 


Gly 


Ser 


Gin 


Gly Asp Arg Glu 


Pro 


Val 


Tyr Arg 










20 






25 








30 


Asp 


Cys 


Val 


Leu 


Gin 


Cys 


Glu 


Glu Gin Asn Cys 


Ser 


Gly Gly Ala 










35 






40 








45 


Leu 


Asn 


His 


Phe 


Arg 


Ser 


Arg 


Gin Pro He Tyr 


Met 


Ser 


Leu 


Ala 










50 






55 








60 


Gly 


Trp 


Thr 


Cys 


Arg 


Asp 


Asp 


Cys Lys Tyr Glu 


Cys 


Met 


Trp 


Val 










65 






70 








75 


Thr 


Val 


Gly 


Leu 


Tyr 


Leu 


Gin 


Glu Gly His Lys 


Val 


Pro 


Gin 


Phe 










80 






85 








90 


His 


Gly 


Lys 


Trp 


Pro 


Phe 


Ser 


Arg Phe Leu Phe 


Phe 


Gin 


Glu 


Pro 










95 






100 








105 


Ala 


Ser 


Ala 


Val 


Ala 


Ser 


Phe 


Leu Asn Gly Leu Ala 


Ser 


Leu 


Val 










110 






115 








120 


Met 


Leu 


Cys 


Arg 


Tyr 


Arg 


Thr 


Phe Val Pro Ala 


Ser 


Ser 


Pro 


Met 










125 






130 








135 


Tyr 


His 


Thr 


Cys 


Val 


Ala 


Phe 


Ala Trp Val Ser 


Leu 


Asn 


Ala 


Trp 










140 






145 








150 


Phe 


Trp 


Ser 


Thr 


Val 


Phe 


His 


Thr Arg Asp Thr Asp 


Leu 


Thr 


Glu 










155 






160 








165 


Lys 


Met 


Asp 


Tyr 


Phe 


Cys 


Ala 


Ser Thr Val He 


Leu 


His 


Ser 


He 










170 






175 








180 


Tyr 


Leu 


Cys 


Cys 


Val 


Arg 


Thr 


Val Gly Leu Gin His 


Pro 


Ala 


Val 










185 






190 








195 


Val 


Ser 


Ala 


Phe 


Arg 


Ala 


Leu 


Leu Leu Leu Met 


Leu 


Thr 


Val 


His 
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Val 


Ser 


Tyr 


Leu Ser 


Leu He Arg Phe 


Val 


Ala 


Asn 


Val Ala 

Z J u 


He Gly Leu Val 


Ala 


Trp 


Cys 


Leu Trp 


Asn Gin Arg Arg 


Cys 


Val 


Val 


Val Val 
9 Afi 


Leu Leu Leu Gin 


Leu 


Leu 


Asp 


Phe Pro 
97^ 


Pro Leu Phe Trp 


He 


Trp 


His 


He Ser 
290 


Thr He Pro Val 


Phe 


Leu 


Glu 


Asp Asp 
305 


Ser Leu Tyr Leu 


Lys 


Phe 


Lys 


Leu Asp 
320 





<210> 4 
<211> 234 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 863808CD1 

<400> 4 



Met 
1 


Gly 


Pro 


Gly 


Gly 
5 


Arg Val Ala Arg 


Trp 


Arg 


Arg 


Ala 


Val 


Ser Ser Val Ala 










20 




Glu 


Pro 


Gly 


Leu 


Arg 


Leu Leu Ala Val 










35 




Ala 


Ala 


Phe 


Cys 


Arg 


Ala Cys Gin Thr 










50 




Leu 


His 


Ser 


Glu 


Pro 


Gly Leu Glu Glu 










65 


Asn 


Glu 


Gly 


Arg 


Pro 


Glu Ser Asp Ala 










80 




Lys 


Phe 


Asp 


lie 


Asp 


Met Met Val Ser 










95 




Ala 


Arg 


Asp 


He 


Cys 


Val He Gin Val 










110 




Thr 


Asp 


Tyr 


Phe 


Val 


He Val Ser Gly 










125 


His 


Ala 


Met 


Ala 


Phe 


Tyr Val Val Lys 










140 


Cys 


Lys 


Arg 


Asp 


Pro 


His Val Lys He 










155 


Asp 


Trp 


Leu 


Cys 


Val 


Asp Phe Gly Ser 










170 




Leu 


Pro 


Glu 


Thr 


Arg 


Glu He Tyr Glu 










185 




Leu 


Arg 


Ser 


Tyr 


Asp 


Asp Gin Leu Ala 










200 




Val 


Pro 


Glu 


Asp 


Phe 


He Leu Gly He 










215 


Val 


Thr 


Pro 


val 


Glu 


Leu Lys Cys Glu 



230 
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2x0 


Asp 


ryr Ka±y Tyr Asn 


Leu 


990 




0 0 R 


Asn 


vax vax ixp xrp 


Leu 








Leu 


fro nis vax Arg 


Lys 






ADD 




T.Alt T aiy T All 

ucu oer Xreu Xteu 


laXU 


265 




A f yj 


Val 


Leu Asp Ala His 


Ala 


280 




285 


His 


Val Leu Phe Phe 


Ser 


295 




300 


Leu 


Lys Glu Ser Glu 


Asp 


310 




315 



Leu 


Leu Ala Pro Leu 


Met 


10 




15 


Gly 


Ser Ala Val Gly 


Ala 


25 




30 


Gin 


Arg Leu Pro Val 


Gly 


40 




45 


Pro 


Asn Phe Val Arg 


Gly 


55 




60 


Arg 


Ala Glu Gly Thr 


Val 


70 




75 


Ala 


Asp His Thr Gly 


Pro 


85 




90 


Leu 


Leu Arg Gin Glu 


Asn 


100 




105 


Pro 


Pro Glu Met Arg 


Tyr 


115 




120 


Thr 


Ser Thr Arg His 


Leu 


130 




135 


Met 


Tyr Lys His Leu 


Lys 


145 




150 


Glu 


Gly Lys Asp Thr 


Asp 


160 




165 


Met 


Val He His Leu 


Met 


175 




180 


Leu 


Glu Lys Leu Trp 


Thr 


190 




195 


Gin 


He Ala Pro Glu 


Thr 


205 




210 


Glu 


Asp Asp Thr Ser 


Ser 


220 




225 
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<210> 5 
<211> 278 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc^f eature 

<223> Incyte ID No: 978433CD1 

<400> 5 

Met Gin Pro Ala Ala Ala Ser Glu Arg Gly Gly Ala Asp Ala Asp 
15 10 15 

His Val Pro Leu Leu Gly Leu Leu Arg Leu Gin Leu Arg Ala Ala 
20 25 30 

Arg Gin Pro Gly Ala Met Arg Pro Gin Gly Pro Ala Ala Ser Pro 
35 40 45 

Gin Arg Leu Arg Gly Leu Leu Leu Leu Leu Leu Leu Gin Leu Pro 
50 55 • 60 

Ala Pro Ser Ser Ala Ser Glu lie Pro Lys Gly Lys Gin Lys Ala 
65 70 75 

Gin Leu Arg Gin Arg Glu Val Val Asp Leu Tyr Asn Gly Met Cys 
80 85 90 

Leu Gin Gly Pro Ala Gly Val Pro Gly Arg Asp Gly Ser Pro Gly 
95 100 ■ 105 

Ala Asn Gly lie Pro Gly Thr Pro Gly He Pro Gly Arg Asp Gly 

110 115 120 

Phe Lys Gly Glu Lys Gly Glu Cys Leu Arg Glu Ser Phe Glu Glu 

125 130 135 

Ser Trp Thr Pro Asn Tyr Lys Gin Cys Ser Trp Ser Ser Leu Asn 

140 145 150 

Tyr Gly He Asp Leu Gly Lys He Ala Glu Cys Thr Phe Thr Lys 

155 160 165 

Met Arg Ser Asn Ser Ala Leu Arg Val Leu Phe Ser Gly Ser Leu 

170 175 180 

Arg Leu Lys Cys Arg Asn Ala Cys Cys Gin Arg Trp Tyr Phe Thr 

185 190 195 

Phe Asn Gly Ala Glu Cys Ser Gly Pro Leu Pro He Glu Ala He 

200 205 210 

He Tyr Leu Asp Gin Gly Ser Pro Glu Met Asn Ser Thr He Asn 

215 220 225 

He His Arg Thr Ser Ser Val Glu Gly Leu Cys Glu Gly He. Gly 

230 235 240 

Ala Gly Leu Val Asp* Val Ala He Trp Val Gly Thr Cys Ser Asp 

245 250 255 

Tyr Pro Lys Gly Asp Ala Ser Thr Gly Tirp Asn Ser Val Ser Arg 

260 265 270 

He He He Glu Glu Leu Pro Lys 

275 



<210> 6 
<211> 136 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc^f eature 

<223> Incyte ID No: 16553 69CD1 

<400> 6 

Met Pro Pro Gly Gly Leu Gly Ala Cys Ala Val Thr Pro Ala Pro 
15 10 15 

Gly Glu Glu Arg Thr Gin Pro Gly Glu Leu Gly Gin Gly Leu His 
20 25 30 

Met Ala Gin Gly Gin. Gin Met Leu Ala Gly Gin Leu Leu Pro Met 



4/28 



wo 00/52151 







35 




T 1 ^^^^ ^ 


ijeu. ijeu 


Pro 


Pro Ser Pne Pro 






DU 




- 

GJ.y Piro 


Ax y iuTCi 


HIS 


Ala Ser Leu Tnr 






OD 




izjp fiec 


/u.a 1 jrp 




Arg Pro Trp Ala 






oU 




Pto Leu 


i*xy \3rxn 


Leu 


Trp Lys Ser Ser 






95 




Ala Ala 


Trp Leu 


Gin 


Pro Leu Ala Leu 






110 




Ala Ser 


Ala Leu 


Ser 


Ala Leu Gly Thr 






125 


Gin 









<210> 7 
<211> 109 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 1703244CD1 

<400> 7 



Met 


Leu 


Met 


Tyr 


Met 


Phe 


Tyr 


Val Leu 


1 








5 








Ala 


Tyr 


Ala 


Leu 


Thr 
20 


Phe 


Pro 


Gly Cys 


Ala 


Leu 


Val 


Phe 


Ala 
35 


Gly 


Gly 


lie Gly 


Met 


Gly 


Ala 


Ser 


Met 


His 


Leu 


Arg Thr 










50 






Pro 


Glu 


Asp 


Thr 


Trp 
65 


Gly 


Cys 


Phe Phe 


Ala 


Leu 


Gly 


Pro 


His 
80 


Leu 


Leu 


Ala Tyr 


Ala 


Phe 


Phe 


His 


Gin 
95 


Pro 


Pro 


Pro Ser 


Lys 


Lys 


Gin 


His 











<210> 8 
<211> 262 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1730819CD1 

<400> 8 

Met Ala Ala Ala Ser Ala Gly Ala Thr 

1 ' 5 

Leu Met Ala Val Ala Ala Pro Ser Arg 
20 

Arg Ala Gly Thr Gly Ala Arg Gly Ala 
35 

Gly Glu Ala Cys Gly Thr Val Gly Leu 
50 

Glu lie Asp Asp Ser Ala Asn Phe Arg 
65 
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40 






45 


Leu 


Pro 


His Pro Thr 


Leu 


33 






60 


Varin 


Leu 


Gly Pro Ala 


Pne 








7b 


His 


Leu 


Gly Pro Gly 


Gin 








90 


Val 


Glu 


Glu His Leu 


Leu 


100 






105 


Leu 


Glu 


Trp Ser Leu 


Gly 


115 






120 


Ser 


His 


Pro Leu Gly 


Leu 


130 




135 



Pro 


Phe 


Cys 


Gly Leu 


Ala 


10 








15 


Ser 


Trp 


Leu 


Pro Asp 


Trp 


25 








30 


Gin 


Ala 


Gin 


Phe Ser 


His 


40 








45 


Pro 


Phe 


Thr 


Tyr Arg 


Val 


55 








60 


Val 


Cys 


Asn 


Leu Leu 


Tyr 


70 








75 


Arg 


Cys 


Leu 


Gin Trp 


Pro 


85 








90 


Asp 


Pro 


Leu 


Ala Leu 


His 


100 








105 



Arg 


Leu 


Leu Leu Leu 


Leu 


10 






15 


Ala 


Arg 


Gly Ser Gly 


Cys 


25 






30 


Gly 


Ala 


Glu Gly Arg 


Glu 


40 






45 


Leu 


Leu 


Glu His Ser 


Phe 


55 






60 


Lys 


Arg 


Gly Ser Leu 


Leu 


70 






75 
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Trp 


Asn 


Gin 


Gin 


Asp 


Gly 


Thr 


Ser 


Glu 


Glu 


Glu 


Arg 


Gly 


Arg 


Gly 


Leu 


Tyr 


Arg 


Val 
1 1 n 

J. J.U 


Arg 


He 


Gly 


Leu 


Glu 


Ala 


Gly 


Gly 


Tyr 


Ser 


Leu 


Val 


Glu 


Ser 


His 


Leu 


Asp 


Val 


Ala 


Gly 


Asn 

±33 


Val 


Val 


Gly 


Gly 


Cys 


Arg 


Gly 

± / U 


His 


Glu 


Phe 


Asn 


Thr 


Ser 


Val 

±oD 


Gin 


Leu 


Pro 




Thr 


Ala 


o nn 


iriie 


Tl o 


Gin 


Lys 


Ala 


Lys 


Asn 
215 


Pro 


Gin 


Tyr 


Trp 


Met 


Tyr 


He 
230 


He 


Pro 


Gly 


Ala 


Pro Asp 


Thr 


Gly 


Gly 










245 






Gly 


Gly 


Gly Gly 


Ser 


Gly 


Arg 



260 



Leu 


Ser Leu 


Ser Gin Arg Gin 


Lieu 




c53 






on 


Leu 


ivtg /isp 


vax Axa Axa 


Leu 


Asn 




1 nn 
X uu 






XU3 


Pro 


rvi y Arg 


irro vjiy AX a 


Leu 


Asp 




1 1 *^ 
X X3 






ion 
xz u 


vd J. 


Ser Ser 


Phe Val Pro 


Ala 


Cys 




1 "^0 
X 






X^D 




Asp u xn 


Leu Thr Leu 


His 


vax 




Xft3 






X3U 


m vr 


vox oer 


Val Val Thr 


His 


Pro 




Xou 






Xb3 




oXU ASp 


Val Asp Leu Glu 


Leu 




1 7t; 

X / 3 






1 on 
loU 


vvxn 


Pro Pro 


Thr Thr Ala 


Pro 


Giy 




x?u 






IOC 
X73 


ox u 


jjcu 


Glu Met Glu 


Gin 


AX a 










^XU 


Glu 


Gin Lys 


Ser Phe Phe 


Ala 


Lys 




220 






225 


Val 


Val Leu 


Phe Leu Met 


Met 


Ser 




235 






240 


Gin 


Gly Gly 


Gly Gly Gly Cys 


Gly 




250 






255 



<210> 9 
<211> 384 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> xnisc_feature 

<223> Incyte ID No: 1757161CD1 

<400> 9 



Met 


Ala 


Glu 


Gin 


Thr^ 


' Tyr 


Ser 


Trp 


Ala 


Tyr 


Ser Leu Val Asp 


Ser 


1 








5 










10 


15 


Ser 


Gin 


Val 


Ser 


Thr 


Phe 


Leu 


He 


Ser 


He 


Leu Leu He Val 


Tyr 










20 










25 




30 


Gly 


Ser 


Phe 


Arg 


Ser 


Leu 


Asn 


Met 


Asp 


Phe 


Glu Asn Gin Asp 


Lys 










35 










40 


45 


Glu 


Lys 


Asp 


Ser 


Asn 


Ser 


Ser 


Ser 


Gly Ser 


Phe Asn Gly Asn 


Ser 










50 










55 




60 


Thr 


Asn 


Asn 


Ser 


He 


Gin 


Thr 


He 


Asp 


Ser 


Thr Gin Ala Leu 


Phe 










65 










70 




75 


Leu 


Pro 


He 


Gly 


Ala 


Ser 


Val 


Ser 


Leu 


Leu 


Val Met Phe Phe 


Phe 










80 










85 




90 


Phe 


Asp 


Ser 


Val 


Gin 


Val 


Val 


Phe 


Thr 


He 


Cys Thr Ala Val 


Leu 










95 










100 




105 


Ala 


Thr 


He 


Ala 


Phe 


Ala 


Phe 


Leu 


Leu 


Leu 


Pro Met Cys Gin 


Tyr 










110 










115 


120 


Leu 


Thr 


Arg 


Pro 


Cys 


Ser 


Pro 


Gin 


Asn 


Lys 


He Ser Phe Gly 


Cys 










125 










130 




135 


Cys 


Gly 


Arg 


Phe 


Thr 


Ala 


Ala 


Glu 


Leu 


Leu 


Ser Phe Ser Leu 


Ser 










140 










145 




150 


Val 


Met 


Leu 


Val 


Leu 


He 


Trp 


Val 


Leu 


Thr 


Gly His Trp Leu 


Leu 










155 










160 


165 


Met 


Asp 


Ala 


Leu 


Ala 


Met 


Gly 


Leu 


Cys 


Val 


Ala Met He Ala 


Phe 










170 










175 




180 


Val 


Arg 


Leu 


Pro 


Ser 


Leu 


Lys 


Val 


Ser 


Cys 


Leu Leu Leu Ser 


Gly 










185 










190 




195 


Leu 


Leu 


He 


Tyr 


Asp 


Val 


Phe 


Trp 


Val 


Phe 


Phe Ser Ala Tyr 


He 
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200 205 210 

Phe Asn Ser Asn Val Met Val Lys Val Ala Thr Gin Pro Ala Asp 

215 220 225 

Asn Pro Leu Asp Val Leu Ser Arg Lys Leu His Leu Gly Pro Ash 

230- 235 240 

Val Gly Arg Asp Val Pro Arg Leu Ser Leu Pro Gly Lys Leu Val 

245 250 255 

Phe Pro Ser Ser Thr Gly Ser His Phe Ser Met Leu Gly lie Gly 

260 265 270 

Asp lie Val Met Pro Gly Leu Leu Leu Cys Phe Val Leu Arg Tyr 

275 280 285 

Asp Asn Tyr Lys Lys Gin Ala Ser Gly Asp Ser Cys Gly Ala Pro 

290 295 300 

Gly Pro Ala Asn lie Ser Gly Arg Met Gin Lys Val Ser Tyr Phe 

305 310 315 

His Cys Thr Leu lie Gly Tyr Phe Val Gly Leu Leu Thr Ala Thr 

320 325 330 

Val Ala Ser Arg lie His Arg Ala Ala Gin Pro Ala Leu Leu Tyr 

335 340 345 

Leu Val Pro Phe Thr Leu Leu Pro Leu Leu Thr Met Ala Tyr Leu 

350 355 360 

Lys Gly Asp Leu Arg Arg Met Trp Ser Glu Pro Phe His Ser Lys 

365 370 375 

Ser Ser Ser Ser Arg Phe Leu Glu Val 

380 



<210> 10 
<211> 244 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1976095CD1 

<400> 10 



Met 


Asp 


He 


Leu 


Val 


Pro 


Leu 


Leu 


Gin Leu Leu Val Leu Leu Leu 


1 








5 








10 15 


Thr 


Leu 


Pro 


Leu 


His 
20 


Leu 


Met 


Ala 


Leu Leu Gly Cys Trp Gin Pro 
25 30 


Leu 


Cys 


Lys 


Ser 


Tyr 
35 


Phe 


Pro 


Tyr 


Leu Met Ala Val Leu Thr Pro 
40 45 


Lys 


Ser 


Asn 


Arg 


Lys 
50 


Met 


Glu 


Ser 


Lys Lys Arg Glu Leu Phe Ser 
55 60 


Gin 


He 


Lys 


Gly 


Leu 
65 


Thr 


Gly 


Ala 


Ser Gly Lys Val Ala Leu Leu 
70 75 


Glu 


Leu 


Gly 


Cys 


Gly Thr 


Gly 


Ala 


Asn Phe Gin Phe Tyr Pro Pro 










80 








85 90 


Gly 


Cys 


Arg 


Val 


Thr 
95 


Cys 


Leu 


Asp 


Pro Asn Pro His Phe Glu Lys 
100 105 


Phe 


Leu 


Thr 


Lys 


Ser 
110 


Met 


Ala 


Glu 


Asn Arg His Leu Gin Tyr Glu 
115 120 


Arg 


Phe 


Val 


Val 


Ala 

125 


Pro 


Gly 


Glu 


Asp Met Arg. Gin Leu Ala Asp 
130 135 


Gly 


Ser 


Met 


Asp 


Val 
140 


Val 


Val 


Cys 


Thr Leu Val Leu Cys Ser Val 
145 150 


Gin 


Ser 


Pro 


Arg 


Lys 
155 


Val 


Leu 


Gin 


Glu Val Arg Arg Val Leu Arg 
160 165 


Pro 


Gly 


Gly 


Val 


Leu 
170 


Phe 


Phe 


Trp 


Glu His Val Ala Glu Pro Tyr 
175 180 


Gly 


Ser 


Trp 


Ala 


Phe 
185 


Met 


Trp 


Gin 


Gin Val Phe Glu Pro Thr Trp 
190 195 


Lys 


His 


He 


Gly 


Asp Gly 


Cys 


Cys 


Leu Thr Arg Glu Thr Trp Lys 



200 205 210 
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Asp Leu Glu Asn Ala Gin Phe Ser Glu lie Gin Met Glu Arg Gin 

215 220 225 

Pro Pro Pro Leu Lys Trp Leu Pro Val Gly Pro His lie Met Gly 

230 235 240 

Lys Ala Val Lys 



<210> 11 
<211> 326 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> inisc_feature 

<223> Incyte ID No: 2169991CD1 

<400> 11 



Met 


Arg 


Thr 


Glu 


Ala Gin Val 


Pro Ala Leu Gin 


Pro 


Pro 


Glu 


Pro 


1 








5 


10 








15 


Gly 


Leu 


Glu 


Gly 


Ala Met Gly 


His Arg Thr Leu 


Val 


Leu 


Pro 


Trp 










20 


25 








30 


Val 


Leu 


Leu 


Thr 


Leu Cys Val 


Thr Ala Gly Thr 


Pro 


Glu 


Vai 


Trp 










35 


40 








45 


Val 


Gin 


Val 


Arg 


Met Glu Ala 


Thr Glu Leu Ser 


Ser 


Phe 


Thr 


He 










50 


55 








60 


Arg 


Cys 


Gly 


Phe 


Leu Gly Ser 


Gly Ser He Ser 


Leu 


Val 


Thr 


Val 










. 65 


70 








75 


Ser 


Trp 


Gly 


Gly 


Pro Asn Gly 


Ala Gly Gly Thr 


Thr 


Leu 


Ala 


Val 










80 


85 








90 


Leu 


His 


Pro 


Glu 


Arg Gly He 


Arg Gin Trp Ala 


Pro 


Ala 


Arg 


Gin 










95 


100 






105 


Ala 


Arg 


Trp 


Glu 


Thr Gin Ser 


Ser He Ser Leu 


He 


Leu 


Glu Gly 










110 


115 








120 


Ser 


Gly 


Ala 


Ser 


Ser Pro Cys 


Ala Asn Thr Thr Phe Cys 


Cys 


Lys 












130 








135 


Phe 


Ala 


Ser 


Phe 


Pro Glu Gly 


Ser Tarp Glu Ala Cys Gly 


Ser 


Leu 










140 


145 








150 


Pro 


Pro 


Ser 


Ser 


Asp Pro Gly 


Leu Ser Ala Pro 


Pro 


Thr 


Pro 


Ala 










155 


160 








165 


Pro 


He 


Leu 


Arg 


Ala Asp Leu 


Ala Gly He Leu Gly Val 


Ser 


Gly 










170 


175 








180 


Val 


Leu 


Leu 


Phe 


Gly Cys Val 


Tyr Leu Leu His 


Leu 


Leu 


Arg Arg 










185 


190 








195 


His 


Lys 


His 


Arg 


Pro Ala Pro 


Arg Leu Gin Pro 


Ser Arg 


Thr 


Ser 










200 


205 








210 


Pro 


Gin 


Ala 


Pro 


Arg Ala Arg 


Ala Trp Ala Pro 


Ser 


Gin 


Ala 


Ser 










215 . 


220 








225 


Gin 


Ala 


Ala 


Leu 


His Val Pro 


Tyr Ala Thr He 


Asn 


Thr 


Ser 


Cys 










230 


235 








240 


Arg 


Pro 


Ala 


Thr 


Leu Asp Thr 


Ala His Pro His Gly Gly 


Pro 


Ser 










245 


250 








255 


Trp 


Trp 


Ala 


Ser 


Leu Pro Thr 


His Ala Ala His Arg Pro 


Gin Gly 










260 


265 








270 


Pro 


Ala 


Ala 


Trp 


Ala Ser Thr 


Pro He Pro Ala Arg Gly 


Ser 


Phe 










275 


280 








285 


Val 


Ser 


Val 


Glu 


Asn Gly Leu 


Tyr Ala Gin Ala Gly Glu 


Arg 


Pro 










290 


295 








300 


Pro 


His 


Thr 


Gly 


Pro Gly Leu 


Thr Leu Phe Pro Asp 


Pro 


Arg Gly 










305 


310 








315 


Pro 


Arg 


Ala 


Met 


Glu Gly Pro 


Leu Gly Val Arg 











320 325 
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<210> 12 
<211> 105 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_:f€ature 

<223> Incyte ID No: 2616827CD1 

<400> 12 



Met 


Asn 


Leu 


Gly 


Val 


Ser 


Met 


Leu Arg 


He Leu Phe Leu 


Leu Asp 


1 








5 








10 


15 


Val 


Gly 


Gly Ala 


Gin 


Val 


Leu 


Ala Thr 


Gly Lys Thr Pro Gly Ala 










20 








25 


30 


Glu 


lie 


Asp 


Phe 


Lys 


Tyr 


Ala 


Leu He 


Gly Thr Ala Val 


Gly Val 










35 








40 


45 


Ala 


lie 


Ser 


Ala 


Gly 


Phe 


Leu 


Ala Leu 


Lys He Cys Met 


He Arg 










50 








55 


60 


Arg 


His 


Leu 


Phe 


Asp Asp Asp 


Ser Ser 


Asp Leu Lys Ser 


Thr Pro 










65 








70 


75 


Gly 


Gly 


Leu 


Ser 


Asp 


Thr 


He 


Pro Leu 


Lys Lys Arg Ala 


Pro Arg 










80 








85 


90 


Arg 


Asn 


His 


Asn 


Phe 


Ser 


Lys Arg Asp 


Ala Gin Val He 


Glu Leu 










95 








100 


105 



<210> 13 
<211> 626 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2991370CD1 

<400> 13 

Met Ala Pro Ser Ala Asp Pro Gly Met Ser Arg Met Leu Pro Phe 
15 10 15 

Leu Leu Leu Leu Trp Phe Leu Pro He Thr Glu Gly Ser Gin Arg 
20 25 30 

Ala Glu Pro Met Phe Thr Ala Val Thr Asn Ser Val Leu Pro Pro 
35 40 45 

Asp Tyr Asp Ser Asn Pro Thr Gin Leu Asn Tyr Gly Val Ala Val 
50 55 60 

Thr Asp Val Asp His Asp Gly Asp Phe Glu He Val Val Ala Gly 
65 70 75 

Tyr Asn Gly Pro . Asn Leu Val Leu Lys Tyr Asp Arg Ala Gin Lys 
80 85 90 

Arg Leu Val Asn He Ala Val Asp Glu Arg Ser Ser Pro Tyr Tyr 
95 100 105 

Ala Leu Arg Asp Arg Gin Gly Asn Ala He Gly Val Thr Ala Cys 
110 115 120 

Asp He Asp Gly Asp Gly Arg Glu Glu He Tyr Phe Leu Asn Thr 
125 130 135 

Asn Asn Ala Phe Ser Gly Val Ala Thr Tyr Thr Asp Lys Leu Phe 
140 145 150 

Lys Phe Arg Asn Asn Arg Trp Glu Asp He Leu Ser Asp Glu Val 
155 160 165 

Asn Val Ala Arg Gly Val Ala Ser Leu Phe Ala Gly Arg Ser Val 
170 175 180 

Ala Cys Val Asp Arg Lys Gly Ser Gly Arg Tyr Ser He Tyr He 
185 190 195 

Ala Asn Tyr Ala Tyr Gly Asn Val Gly Pro Asp Ala Leu He Glu 
200 205 210 

Met Asp Pro Glu Ala Ser Asp Leu Ser Arg Gly He Leu Ala Leu 
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215 220 225 



Airs' 


Asp 


Val 


Ala 


Ala 


Glu 


Ala 


Gly Val 


Ser 


Lys 


Tyr 


Tnr 


Gly 


Gly 


















O "3 R 

Zoo 










0 >l A 






Val 


Ser 


vai 


Gly Pro 


lie Leu 


Ser 


Ser 


Ser 


Aia 


Ser 


ASp 


















0 A 










o c 

ZOD 




irXlC 


Cys 


Asp 


Asn 


Glu 


Asn 


Gly Pro 


Asn 


pne 


Leu 


Fne 


nlS 


Asn 










Z du 








Zoo 










OTA 
Z i\J 




v»iy 


Asp Gly 


inr 


Phe 


Val 


Asp Ala 


Ala 


Ala 


ser 


Ala 


Giy 


Val 










O CL 








ZOKJ 










o o c 
285 


Asp 


Asp 


Pro 


His 


Gin 


His Gly 


Arg Gly 


Val 


Ala 


Leu 


Ala 


Asp 


Pne 










O Q A 








O Q R 

ZyD 










"3 A A 
300 


Asn 


Arg 


Asp Gly 


Lys 


Val 


Asp 


xie vai 


Tyr 


Gly 


Asn 


Trp 


Asn 


Gly 


















OlU 










315 


Piro 




Arg 


Leu 


Tyr 


Leu 


Gin 


Met Ser 


Thr 


His 


Gly 


Lys 


Val 


Arg 










ion 








^ o c 
oZd 










^ O A 

330 


Jrlie 


Arg 


Asp 


He 


Ala 


Ser 


Pro 


Lys Phe 


Ser 


Met 


Pro 


Ser 


Pro 


Val 


















1 An 
J4U 










345 


AX'S 


inr 


Val 


He 


mil* V 

Tnr 


Ala 


Asp 


Phe Asp 


Asn 


Asp 


Gin 


Glu 


Leu 


Glu 










•J R A 








c c 
J DD 










3 60 


Tl A 




Phe 


Asn 


Asn 


He 


Ala 


Tyr Arg 


Ser 


Ser 


Ser 


Ala 


Asn 


Arg 










ODD 








370 










375 


L€U 




Arg 


Val 


lie 


Arg 


Arg 


Glu His 


Gly 


Asp 


Pro 


Leu 


He 


Glu 










Q Q n 
J oU 








O D C 

3 OD 










390 


Glu 


Leu 


Asn 


Pro 


Gly 


Asp 


Ala 


Leu Glu 


Pro 


Glu 


Gly 


Arg 


Gly 


Thr 










1 o c; 

J y D 








400 










405 


Gly 


Gly 


Val 


Val 


Thr 


Asp 


Phe 


Asp Gly 


Asp 


Gly 


Met 


Leu 


Asp 


Leu 










41U 








415 










420 


lie 


Leu 


Ser 


His 


Gly 


Glu 


Ser 


Met Ala 


Gin 


Pro 


Leu 


Ser 


Val 


Phe 










425 








430 










435 


Arg 


Gly 


Asn 


Gin 


Gly 


Phe 


Asn 


Asn Asn 


Trp 


Leu 


Arg 


Val 


Val 


Pro 










A A f\ 








A A C 

445 










450 


Arg 


Tnr 


Arg 


Phe 


Gly 


Ala 


Phe 


Ala Arg 


Gly 


Ala 


Lys 


Val 


Val 


Leu 










ii c tr 

455 








460. 










465 


Tyr 


Tiir 


Lys 


Lys 


Ser 


Gly Ala 


His Leu 


Arg 


He 


He 


Asp 


Gly 


Gly 










4 /O 








475 










480 


Ser 


Gly 


Tyr 


Leu 


Cys 


Glu 


Met 


Glu Pro 


Val 


Ala 


His 


Phe 


Gly 


Leu 










>l o c 
485 








A AA 

490 








495 


uiy 


Lys 


Asp 


Glu 


Ala 


Ser 


Ser 


Val Glu 


Val 


Thr 


Trp 


Pro 


Asp 


Gly 










c r» r\ 








505 










510 


Lys 


£36 u 


Val 


Ser 


Arg 


Asn 


Val 


Ala Ser 


Gly 


Glu 


Met 


Asn 


Ser 


Val 










DID 








C O A 










c o c 

525 






He 


Leu 


Tyr 


Pro Arg 


Asp Glu 


Asp 


Thr 


Leu 


Gin 


Asp 


Pro 










con 
D J U 








535 










540 


% 1 ss 

Alcl 


Pro 


Leu 


Glu 


Cys 


Gly Gin 


Gly Phe 


Ser 


Gin 


Gin 


Glu 


Asn 


Gly 










D4D 








CCA 

DDVJ 










555 


ca 

nlS 


Cys 


Met 


Asp 


Thr 


Asn Glu 


Cys He 


Gin 


Phe 


Pro 


Phe 


Val 


Cys 










560 


















A 

D / U 


Pro 


Arg 


Asp 


Lys 


Pro 


Val 


Cys 


Val Asn 


Thr 


Tyr 


Gly 


Ser 


Tyr 


Arg 










575 








580 










585 


Cys 


Arg 


Thr 


Asn 


Lys 


Lys 


Cys 


Ser Arg 


Gly 


Tyr 


Glu 


Pro 


Asn 


Glu 










590 








595 










600 


Asp 


Gly 


Thr 


Ala 


Cys 


Val Gly 


Trp Trp 


Ser 


Pro 


Val 


Leu 


Lys 


He 










605 








610 










615 


Val 


Thr 


Pro 


Gin 


Val 


Gly Lys 


Ser Leu 


Gly 


Pro 











620 625 



<210> 14 
<211> 296 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 3031062CD1 
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<400> 14 



Vfot 




1 tp 


ixrp 


AXd 


Ser 


Ser 


Pro 










■J 








Phe 


Iji6U 


Leu 


Pro 


OCX 


Axa 


fil n 


ftl \r 










20 








Tim 


Lys 


Val 


Phe 


Tie 
xxc 




ai n 










35 










Glu 


Pro 


Cys 


Ser 


Ser 


Gin 












50 








He 


Glu 


Glu Asp 


Leu 


Thr 

X IIX 


«rX U 




















PltS Km 




Ala 


Glu 


V dx 


vox 


Arg 


Arg 










O \J 










X IIX 


Lys Asn 


Arg 




Tyr 


Arg 


















Cat* 

OCX 


Axg 


Cys 


Ser 


v»xy 


Val 


^21 11 
UXU 


nxs 










110 

X X V 












Pro 


Asp 






rieu 


Val 

vax 










125 








m n 


VCIX 


Pro 


Lys 


•Lrp 




m 11 


Pro 










X 4 li/ 










Xjys 


Thr 


Ser 


Vj J- u 


Tyr 


xlxS 


Asp 










X 








Jtrlie 


Trp 


Glu 


Gly 


vjxy 


Pro 














X / U 












Arg 


Trp 


Asp 


Leu 




Arg 










X O 








AT a 




Trp 


Pro 


Trp 


Lys 


Lys 


Lys 










z u u 










S6]r 


Arg 


Thr 


oer 


Pro 


oJ.U 


Arg 










Z X D 








Arg 


Lys 


Asn 


Pro 


Lys 


Leu 


vax 


Asp 










230 








Ala 


Trp 


Lys 


Ser 


Met 


Lys 


Asp 


Thr 










245 








Asp 


Val 


His 


Leu 


Val 


Asp 


His 


Cys 










260 








.Phe 


Arg 


Gly Val 


Leu 


Gin 


Val 


Ser 










275 








Val 


Ala 


He 


He 


Leu 


Met 


Arg 


Lys 



290 



<210> 15 
<211> 249 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 3101617CD1 

<400> 15 



Met Asp Gly 


Lys 


Lys 


Cys 


Ser 


Val 


1 






5 








Phe 


Thr Leu 


Phe 


Thr 


Ser 


Ala 


Gly 








20 








Ala 


Val Glu 


Asp 


Asp 


Lys 


lie 


Leu 








35 






Lys 


Pro Gly 


Val 


Lys 


His 


Ala 


Pro 








50 








Asp 


Pro Pro 


Ala 


Ser 


Cys 


Val 


Phe 








65 








Ala 


Phe Leu 


Ala 


Leu 


Val 


Val 


Ala 



80 



Leu Arg 


Leu 


Trp 


Leu 


Leu 


Leu 




10 










13 


Arg 


Gin 


Lys 


Glu 


Ser 


Gly 


Ser 




25 












He 


Asn 


Arg 


Ser 


Leu 


Glu 


Asn 




40 












Cys 


Ser 


Cys 


Tyr 


His 


Gly 


Val 




55 










50 


Arg Gly 


Gly 


lie 


Ser 


Arg 


Lys 




70 










•7 C 

75 


Lys 


Leu 


Gly 


Tnr 


His 


Tyr 


Gin 




85 










90 


Glu 


Asn 


Asp 


Cys 


Met 


Pne 


Pro 




100 










105 


Phe 


He 


Leu 


Glu 


Val 


He 


Gly 




115 










120 


He 


Asn 


Val 


Arg 


Asp 


Tyr 


Pro 




130 










135 


Ala 


He 


Pro 


Val 


Phe 


Ser 


Phe 




145 










150 


He 


Met 


Tyr 


Pro 


Ala 


Trp 


Thr 




160 










165 


Trp 


Pro 


He 


Tyr 


Pro 


Thr 


Gly 




175 










180 


Glu 


Asp 


Leu 


Val 


Arg 


Ser 


Ala 




190 










195 


Asn 


Ser 


Thr 


Ala 


Tyr 


Phe 


Arg 




205 










210 


Asp 


Pro 


Leu 


He 


Leu 


Leu 


Ser 




220 










225 


Ala 


Glu 


Tyr 


Thr 


Lys 


Asn 


Gin 




235 










240 


Leu Gly 


Lys 


Pro 


Ala 


Ala 


Lys 




250 










255 


Lys 


Tyr 


Lys 


Tyr 


Leu 


Phe 


Asn 




265 










270 


Gly Leu 


Asn 


Thr 


Ser 


Ser 


Cys 




280 










285 


Arg 


Thr 


Tyr 












295 












Trp 


Met 


Pne 


Leu 


Pro 


Leu 


• 

Val 




10 










-15 


Leu 


Trp 


He 


Val 


Tyr 


Phe 


He 




25 










J u 


Pro 


Leu 


Asn 


Ser 


Ala 


Glu 


Arg 




40 










45 


Tyr 


He 


Ser 


He 


Ala 


Gly 


Asp 




55 










60 


Ser 


Gin 


Val 


Met 


Asn 


Met 


Ala 




70 










75 


Val 


Leu 


Arg 


Phe 


He 


Gin 


Leu 



85 90 
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Lys 


Pro 


Lys Val 


Leu 
95 


Asn 


Pro 


Ala 


Leu 


Cys Leu 


Ala 
110 


Ser 


Phe 


Gin 


Leu 


Thr Asn 


Asp Glu 


Glu 








125 






Thr 


Phe 


Gly Phe 


Gly 
140 


Thr 


Leu 


Thr 


Leu 


Lys Val 


Asn 
155 


He 


Lys 


Pro 


Arg 


vciX ±±& 


Leu 
170 


Ser 


Axa 


Tyr 


Phe 


lie" Leu 


Met 
185 


Ala 


Gin 


Val 


Gin 


Trp Gly 


Leu 
200 


Val 


Met 


Phe 


Ala 


Val Glu 


Phe . Arg 


His 








215 






Glu 


Tyr 


Gin Glu 


Asn 
230 


Phe 


Leu 


Ala 


Ser 


Glu Tyr 


Gin 
245 


Thr 


Asp 



Trp 


Leu 


Asn 


xxe ber Gxy Lieu 


vax 






1 nn 




±\JD 




Met 


Thr 


JjcU IjcfU urJ.y AO 11 


XrXitr 






115 




ion 
xz u 




His 


Asn 




Leu 






130 


X3Q 


X ixx 


Cys 


Trp 


Tlo el^r^ a1 a alia 
x±c V3J.ll /vJ.a AJ.a 


Leu 






145 




1 ^n 

X 3U 




Glu Gly 


rViy Aiy VaX '''■^y 


XXc 






160 




1 fit: 
X D D 




He 


Thr 


Ijcu v-y5 Vox VaX 


Leu 






175 




X OU 


ser 


He 


His 


neu lyr ax a Axa 


Arg 






190 






cys 


Phe 


Leu 


oer iyr fne wxy 


inr 






205 




210 


Tyr 


Arg 


Tyr 


Glu He Val Cys 


Ser 






220 




225 


Ser 


Phe 


Ser 


Glu Ser Leu Ser 


Glu 






235 




240 


Gin 


Val 









<210> 16 
<211> 124 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 3216178CD1 

<400> 16 



Met 


Gly 


Gly 


Tyr 


Leu 


Lys 


Thr Arg 


1 








5 






Tyr 


Leu 


Cys 


Leu 


Met 

20 


Pro 


Ala Ala 


Leu 


Leu 


Trp 


Leu 


Ser 
35 


Leu 


Gly Val 


Pro 


Gin 


Asn 


Leu 


Cys 
50 


Cys 


Leu Gly 


Gly 


Ser 


Cys 


Tyr 


Cys 
65 


Asp 


Glu Phe 


His 


Pro 


Asp 


His 


Ser 
80 


Val 


Leu Cys 


Lys 


Met 


Val 


Leu 


Gin 
95 


Met 


Val Leu 


Pro 


Ala 


Arg 


Ser 


His 
110 


Leu 


Asp Trp 


Leii 


Gin 


Val 


Leu 









Pro 


Trp 


Thr 


Leu 


Gin 


His 


Phe 




10 










15 


Thr 


Trp 


Leu 


Val 


Leu 


Leu 


Leu 




25 










30 


Lys 


Thr 


Gly 


Ser 


Cys 


Ser 


Gin 




40 










45 


Thr 


Asp 


.His 


His 


Cys 


Lys 


Arg 




55 










60 


Cys 


His 


Val 


Ala 


Pro 


Asp 


Cys 




70 










75 


Asn 


Pro 


Ala 


Ser 


Gin 


Met 


Thr 




85 










90 


Arg 


Met 


Glu 


Asn 


Pro 


Pro 


Ser 




100 










105 


Met 


Gin 


Ser 


Met 


Val 


Ser 


Ser 




115 










120 



<210> 17 
<211> 101 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 3406803CD1 

<400> 17 
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1 


L611 


Pro 


Val 


Gly 


Ala Gin Pro 


Arg 


X 

A±Gi 


Arcf 


Leu 


Leu 


5 

His 


Pro Arg Gly 


Pro 










20 






P^TO 


IT lie 


Leu 


Pro 


Trp 


v?j.y ber Lteu 


VjXU 
















Tyc 


Arg 


Ala Cys 


Ser 


Pro Gly Trp 


GXU 










dU 






Pro 


Glu 


Arg 


Glu 


Thr 


Leu Ser Gly 


Gly 










65 






Ala 


Gly 


Ser 


Met 


Val 


Gly Gly Gly 


Glu 










80 






Leu 


Cys 


Val 


Arg 


Leu 


Leu Thr Lys 


Leu 










95 







PCT/USOO/05621 



Ser 


Pro 


Pro 


Trp 


Val 


Leu 


10 










15 


Ala 


Ala 


Thr 


Ser 


Leu 


Val 


25 










30 


Ser 


His 


Thr 


Pro 


Cys 


Pro 


40 










45 


L6U 




Leu 


Ser 


xnr 


13 Via 

fne 


55 










60 


Glu 


Val 


Arg 


Lys 


Arg 


Gly 


70 










75 


Ser 


Thr 


Met 


Thr 


Arg 


Ala 


85 










90 


Arg 


Val 











100 



<210> 18 
<211> 540 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 3468066'CDl 

<400> 18 



Meu 


Ala 


Tnr 


Ser 


Gly 


Ala 


Ala 


Ser 


Ala 


Glu 


Leu 


Val 


He 


Gly 


Trp 


1 








5 










10 








15 


Cys 


lie 


pne 


Gly 


Leu 


Leu 


Leu 


Leu 


Ala 


He 


Leu 


9i ^ _ 
Ala 


Phe 


Cys 


Trp 










20 










25 








30 






.veil 


Arg 


Lys 


Tyr 




Ser 


Arg 


Arg 


V7lU 


oer 


ulU 


vai 


vai 










35 










40 










45 


Ser 


Thr 


lie 


Thr 


Ala 


He 


Phe 


Ser 


Leu 


Ala 


He 


Ala 


Leu 


He 


Thr 










50 










55 










60 


Ser 


Ala 


Leu 


Leu 


Pro 


Val 


Asp 


He 


Phe 


Leu 


Val 


Ser 


Tyr 


Met 


Lys 










65 










70 










75 


Asn 


Gin 


Asn 


Gly 


Thr 


Phe 


Lys 


Asp 


Trp 


Ala 


Asn 


Ala 


Asn 


Val 


Ser 










80 










85 










90 


Arg 


Gin 


He 


Glu 


Asp 


Thr 


Val 


Leu 


Tyr 


Gly 


Tyr 


Tyr 


Thr 


Leu 


Tyr 










95 










100 










105 


Ser 


Val 


He 


Leu 


Phe 


Cys 


Val 


Phe 


Phe 


Trp 


He 


Pro 


Phe 


Val 


Tyr 










110 










115 










120 


Phe 


Tyr 


Tyr 


Glu 


Glu 


Lys 


Asp 


Asp 


Asp 


Asp 


Thr 


Ser 


Lys 


Cys 


Thr 










125 










130 










135 


Gin 


He 


Lys 


Thr 


Ala 


Leu 


Lys 


Tyr 


Thr 


Leu 


Gly 


Phe 


Val 


Val 


He 










140 










145 










150 


Cys 


Ala 


Leu 


Leu 


Leu 


Leu 


Val 


Gly 


Ala 


Phe 


Val 


Pro 


Leu 


Asn 


Val 










155 










160 










165 


Pro 


Asn 


Asn 


Lys 


Asn 


Ser 


Thr 


Glu 


Trp 


Glu 


Lys 


Val 


Lys 


Ser 


Leu 










170 










175 








180 


Phe 


Glu 


Glu 


Leu 


Gly 


Ser 


Ser 


His 


Gly 


Leu 


Ala 


Ala 


Leu 


Ser 


Phe 










185 










190 










195 


Ser 


He 


Ser 


Ser 


Leu 


Thr 


Leu 


He 


Gly 


Met 


Leu 


Ala 


Ala 


He 


Thr 










200 










205 










210 


Tyr 


Thr 


Ala 


Tyr 


Gly 


Met 


Ser 


Ala 


Leu 


Pro 


Leu 


Asn 


Leu 


He 


Lys 










215 










220 










225 


Gly 


Thr 


Arg 


Ser 


Ala 


Ala 


Tyr 


Glu 


Arg 


Leu 


Glu 


Asn 


Thr 


Glu 


Asp 










230 










235 










240 


He 


Glu 


Glu 


Val 


Glu 


Gin 


His 


He 


Gin 


Thr 


He 


Lys 


Ser 


Lys 


Ser 










245 










250 








255 


Lys 


Asp 


Gly 


Arg 


Pro 


Leu 


Pro 


Ala 


Arg 


Asp 


Lys 


Arg 


Ala 


Leu 


Lys 










260 










265 










270 


Gin 


Phe 


Glu 


Glu 


Arg 


Leu 


Arg 


Thr 


Leu 


Lys 


Lys 


Arg 


Glu 


Arg 


His 










275 










280 










285 


Leu 


Glu 


Phe 


He 


Glu 


Asn 


Ser 


Trp 


Trp 


Thr 


Lys 


Phe 


Cys 


Gly 


Ala 
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290 



Leu 


Arg 


Pro 


Leu 


Lys 
305 


lie 


Val 


Ala 


Leu 


Leu 


Pne 


Val 
320 


He 


Ser 


Ala 


Leu 


His 


Ser 


Ala 
335 


Gly 


He 


Ala 


Asn 


Leu 


Ser 


Asn 
350 


Pro 


Leu 


Tiir 


Val 


Pne 


Pro 


Leu 
365 


Asp 


Tyr 


Tyr 


Pne 


lie 


Pne 


Thr 
380 


Ser 


Met 


Trp 


Pne 


Phe 


Trp 


He 
395 


Arg 


Leu 


Arg 


Pro 


Gin 


Ala 


Leu 
410 


Leu 


Pne 


Val 


Leu 


His 


Thr 


Ser 
425 


Tyr 


Met 


Val 


Met 


Tyr Gly 


Ser 


Gin 


Asn 










440 






Ser 


Asp 


Asn 


His 


Lys 
455 


Gly 


Asn 


Cys 


Asp 


Ala 


Glu 


Ala 

Aid 


Pro 


Glu 


Tyr 


Leu 


Phe 


Leu 


His 
485 


Lys 


Phe 


Phe 


Gly 


Asn 


Trp 


Ala 
500 


Phe 


Leu 


Val 


Ser 


Cys 


Cys 


Lys 
515 


Gly 


Lys 


Glu 


Asp 


Ser Asp 


He 


Ser 


Asp 



530 





295 






•a f\f\ 
300 


Trp Gly 


He 


Phe 


Phe 


He Leu Val 




310 






315 


Leu Phe 


Leu 


Ser 


Asn 


Leu Asp Lys 




325 






"5 O A 

3 30 


Asp Ser Gly 


Phe 


He 


xi.e xrne - Osiy 




340 






345 


Asn Met 


Leu 


Leu 


Pro 


T All T All T 

ijeu j-ieu k^xn 




o c c 

355 






OCA 

3 50 


He Leu 


He 


Thr 


He 


lie lie Met 




370 






1 *T e 

375 


Ala Gly 


He 


Arg Asn 


He Gly He 




385 






O A A 

390 


Tyr Lys 


He 


Arg Arg 


Gly Arg Thr 




400 






405 


Leu Cys 


Met 


He 


Leu 


Leu Leu He 




415 






420 


He Tyr 


Ser 


Leu 


Ala 


Pro Gin Tyr 




430 






435 


Tyr Leu 


He 


Glu 


Thr 


Asn He Thr 




445 






450 


Ser Thr 


Leu 


Ser 


Val 


Pro Lys Arg 




460 






465 


Asp Gin Cys 


Thr 


Val 


Thr Arg Thr 




475 






480 


Trp Phe 


Phe 


Ser 


Ala 


Ala Tyr Tyr 




490 






495 


Gly Val 


Phe 


Leu 


He 


Gly Leu He 




505 






510 


Lys Ser 


val 


He 


Glu 


Gly Val Asp 




520 






525 


Asp Glu 


Pro 


Ser 


Val 


Tyr Ser Ala 




535 






540 



<210> 19 
<211> 108 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 3592862CD1 

<400> 19 



Met 


Thr 


Pro 


Ser Arg 


Leu Pro Trp Leu Leu 


Ser Trp 


Val Ser Ala 


1 






5 


10 




15 


Thr 


Ala 


Trp Arg Ala 


Ala Arg Ser Pro Leu 


Leu Cys 


His Ser Leu 








20 


25 


30 


Arg 


Lys 


Thr 


Ser Ser 


Ser Gin Gly Gly Lys 


Ser Glu 


Leu Val Lys 








35 


40 




45 


Gin 


Ser 


Leu 


Lys Lys 


Pro Lys Leu Pro Glu 


Gly Arg 


Phe Asp Ala 








50 


55 




60 


Pro 


Glu 


Asp 


Ser His 


Leu Glu Lys Glu Pro 


Leu Glu 


Lys Phe Pro 








65 


70 




75 


Asp 


Asp 


Val 


Asn Pro 


Val Thr Lys Glu Lys 


Gly Gly 


Pro Arg Gly 








80 


85 




90 


Pro 


Glu 


Pro 


Thr Arg 


Tyr Gly Asp Trp Glu 


Arg Lys 


Gly Arg Cys 








95 


100 




105 


lie 


Asp 


Phe 











<210> 20 
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<211> 114 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 3669422CD1 

<400> 20 



Met 


Ser 


Ser 


Ser 


Ser 


Ser Arg 


Cys Leu 


Ser 


Pro Ser 


Pro Giy Met 


1 








5 






10 




15 


Ser 


Leu 


Trp 


Ser 


Cys 


Leu Leu 


Phe Leu 


Cys 


Thr Pro 


Ser Pro Thr 










20 






25 




30 


Thr 


Thr 


Ser 


Pro 


Ser 


Pro Asp 


Pro Ser 


Gin 


Val Ser 


Thr Leu Pro 










35 






40 




45 


Thr 


Pro 


Ser 


Pro 


Gin 


Arg Glu 


Gly Leu 


Lys Gin Gly Gin Trp Arg 










50 






55 




60 


Lys 


Thr 


Gly Pro 


Ser 


Ser Thr 


His Pro 


His 


Thr Pro 


Ser Ser Arg 










65 






70 




75 


Pro 


Pro 


Ser 


Pro 


Ser 


Ser Leu 


Pro Leu 


Thr 


Trp Lys 


Leu Leu Gin 










80 






85 




90 


Pro 


He 


Pro 


Ser 


His 


Ser Leu 


Pro His 


Pro 


Pro Lys 


He His Thr 










95 






100 




105 


Gly 


Pro 


Ser 


Leu 


Ala 


Glu Cys 


Gly His 









110 



<210> 21 
<211> 114 . 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 3688740CD1 

<400> 21 



Met 


Arg 


Gly Glu 


His 


Asn 


Ser 


Thr Ser 


Tyr 


Asp Ser Ala Val 


He 


1 






5 








10 




15 


Tyr 


Arg 


Gly Phe 


Trp 


Ala 


Val 


Leu Met 


Leu 


Leu Gly Val Val Ala 








20 








25 




30 


Val 


val 


He Ala 


Ser 


Phe 


Leu 


He He 


Cys 


Ala Ala Pro Phe 


Ala 








35 








40 




45 


Ser 


His 


Phe Leu 


Tyr 


Lys 


Ala 


Gly Gly 


Gly 


Ser Tyr He Ala 


Ala 








50 








55 




60 


Asp 


Gly 


He Ser 


Ser 


Leu 


Cys 


Tyr Ser 


Ser 


Leu Ser Lys Ser 


Leu 








65 








70 




75 


Leu 


Ser 


Gin Pro 


Leu 


Arg 


Glu 


Thr Ser 


Ser 


Ala He Asn Asp 


He 








80 








85 




90 


Ser 


Leu 


Leu Gin 


Ala 


Leu 


Met 


Pro Leu 


Leu Gly Trp Thr Ser 


His 








95 








100 




105 


Trp 


Thr 


Cys He 


Thr 


Val 


Gly 


Leu Tyr 









110 



<210> 22 

<211> 287 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 3742589CD1 
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<400> 22 
Met Glu Leu 
1 

Gin Thr His 


lie 


Pne 


Ser 


Gly 


Pro 


ser 


nec 


GIU 


Ala 


lie 


Gly 


Asp 




Asn 


Lys 


Gin 


Val 


Pro 


Lys 


Glu 


Glu 


Asp 


Glu 


Ala 


Vai 


Leu 


Leu 


Trp 


Val 


Leu 


Met 


Leu 


Val 


Pro 


Asn 


Gin 


Glu 


Leu 


Lys 


Ala 


Glu 


Asp 


Pro 


Lys 


Lys 


Lys 


Gly 


Glu 


Met 


Lys 


Ala 


Phe 


His 





Glu 


Arg 


lie 




D 




Leu 


Pro 


Glu 




20 




Tyr 


vai 


Leu 




35 




Glu 


Glu 


Asn 




C f\ 

50 




Tyr 


Val 


Pro 




o5 




Met 


Met 


Gin 




oU 




Glu 


Asn 


Leu 




95 




lie 


Ser 


Pro 




110 




Thr 


Arg 


Ser 




lZ3 




Thr 


Gly 


Ala 




140 




Glu 


Val 


Phe 




155 




Ala 


Lys 


Ala 




170 




Glu 


Gly 


Lys 




185 




Asp 


Leu 


Pro 




200 




Ser 


Phe 


lie 




215 




Gin 


Lys 


lie 




230 




Leu 


lie 


Arg 




245 




Arg 


Phe 


Lys 




260 




Thr 


Tyr 


He 




275 





Val 


Ser 


Ala 


Ala 


Asp 


Leu 


Gly 


Val 


Leu 


Phe 


Asp 


Met 


Gly 


Phe 


Ala 


Lys 


Leu 


Ser 


Gin 


Pro 


Gin 


Glu 


Pro 


Leu 


Ser 


Ala 


Ala 


Glu 


Glu 


Glu 


Pro 


Thr 


Cys 


Arg 


Gly 


Asp 


Glu 


Glu 


Gly 


Arg 


Arg 


Leu 


Leu 


Gin 


Lys 


His 


Arg 


Pro 


Tyr 


He 


Asp 


Asp 


Val 


Arg 


Asn 


Leu 


Lys 



Ala 

Aia 


Leu 


Leu 


lU 






Ser 


vjiy 


Leu 


OK 


■ 




V7XU 


Asp 


Leu 








r* 1 11 
VjIU 


Aia 


jrne 


DD 






nlS 


lie 


Pro 


/U 






vjiy 


\3lXi 


Leu 


Q EI 






Ser 


Ser 


vaiy 


lUU 






Gin 


Arg 


Pro 


lie 
113 






Ala 


Ala 


Ala 


130 






Leu 


Leu 


Pro 


14b 






Ser 


vai 


Glu 


1 C A 
1 OU 






Leu 


Glu 


Glu 


175 






Pro 


Ala 


Ala 


190 






Arg 


Gly 


Pro 


205 






Tyr 


Met 


Met 


n A 






Met 


Ala 


Pro 


235 






Asn 


Gin 


Val 


250 






Asn 


Pro 


Glu 


265 






Pro 


Ala 


Arg 


280 







Ala 


Fne 


vai 








Asp 


VjrXU 


vai 








Vaiy 


Fro 


Ser 






A K 


inr 


vjIU 


Ma*- 

ciec 








Arg 


Gly 


Thr 






T C 

75 


Ser 


ASp 


Ala 






OA 


vai 


Gin 


Gly 






1 AC 

105 


GIU 


Met 


Leu 






120 


Asp 


Thr 


Gin 








Gly 


Val 


Asp 






1 C A 

IDU 


Gin 


Ala 


Gin 






ICC 

165 


Ala 


Vai 


Gin 






180 


Trp 


Glu 


Gly 






195 


Gin 


Lys 


Asp 






210 


Val 


Asp 


Ser 






o o c 

225 


Lys 


Glu 


Ala 






240 


Val 


Ser 


Thr 






255 


Ala 


Glu 


Glu 






270 


Lys 


Tyr 


Arg 






285 



<210> 23 

<211> 854 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 078811CB1 

<400> 23 

attttgcctc gtggacccaa aggtagcaat ttgaaacatg aggagtacga ttctactgtt 60 
ttgtcttcta ggatcaactc ggtcattacc agtctttcct tctttaagtc tgataccatt 120 
aacacagatg ctcacactgg ggccagatct gcatctgtta aatcctgctg caggaatgac 180 
acctggtacc cagacccacc cattgaccct gggagggttg aatgtacaac agcaactgca 240 
cccacatgtg ttaccaattt ttgtcacaca acttggagcc ccagggcact atcctaagct 300 
cagaggaatt gccacaaatc ttcacgagcc tcatcatcca ttccttgttc cccgggaggc 360 
atccttgccc accagtcagg caggggctaa tccagatgtc caggatggaa gccttccagc 42 0 
aggaggagca ggtgtaaatc ctgccaccca gggaacccca gcaggccgcc tcccaactcc 480 
cagtggcaca gatgacgact ttgcagtgac cacccctgca ggcatccaaa ggagcacaca 540 
tgccatcgag gaagccacca cagaatcagc aaatggaatt cagtaagctg tttcaaattt 600 
tttcaactaa gctgcctcga atttggtgat acatgtgaat ctttatcatt gattatatta 660 
tggaatagat tgagacacat tggatagtct tagaagaaat taattcttaa tttacctgaa 720 
aatattcttg aaatttcaga aaatatgttc tatgtagaga atcccaactt ttaaaaacaa 780 
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taattcaatg gataaatctg tctttgaaat ataacattat gctgcctgga tgatatgcat 840 
attaaaacga atta 854 



<210> 24 

<211> 1804 

<212> DNA . 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 371156CB1 



<400> 24 

gtgataggca gctttccttc ttttcaacag 
gctgaggttt tgtgctcact gaaagggctg 
ggtatgtgaa gatgcaccgt cttttcaaat 
ggggcctgcc cttctctgct gtgtcctttc 
cccactgggg catataaaat ctctgctgta 
accaggaccc actggggcat ataaaaaagt 
gttacttagg aaacagactt cgcatttcaa 
tctggcttct agacaaattc ctgatagaac 
cccatcccct gtttagggat agagttgata 
cctgaatttt tttaattgac ttttgagctt 
tgctgttgtg gataggaaag acttaaccta 
aatctaacaa tatgaagggc tcttatgagt 
acgcacagga acgaaatacc tcccagaaac 
tatttttaaa aagtatacag atcaaagcaa 
gcaaatattt tttaaggcag tattaagtgc 
cacatggcta ctgggaatat aaatttcgct 
tggcaaaacc ttaagattgt gtactggggc 
gaggaaatta tccgagatcc ccacaaactg 
catacacaaig aaaaacagag aaaagcctga 
ttacggtgtg tctgcatgag gcttttatga 
cagtggctca tgcctgtaat cctaacactt 
ctcaggagtt tgagaccagc ctgggcaaca 
aaaattagcc gggcgtcgca gcatgcgcct 
ggagaattga ttgaacccgg gaggcagagg 
tccagcctgg gcgacagagc aagattccgt 
gggcaaaggg agagaatcat aacatctgat 
actatataag gatggtccca gctgtgtcaa 
aaaattaaat agaggtgaac acaattattt 
actaagactt tctagaattt tacttattca 
atatactttt gtacitcagaa aatattaaat 
aaaa 



tgatacctac gaaaatcaaa ataaatgcaa 60 
tcaaccccag aaggccgaca caaaaaaaat 120 

ggcctgggag agtcaaatgg cctgggagag 180 
ggcttcccag ttgagctccc aagaccagga 240 
tcctttcggc ttcccagttg agctcccaag 300 
caaaaatcaa aatcaaacaa caagttctga 360 
tcagagaggc cacagagcaa ggtctaaact 420 
atttaaatgt gggaagtggc ttccccaggt 480 
tcatttttat aggtgccatg tatgcctctg 540 
ttgagattgc acgagggaga acaaggcctt 600 
aaattaaacc agcaagaaag cattagtaaa 660 
catttttttc aaaagatgaa aactccagaa 720 
atgaagcaat catcgaagac tcactggtaa 780 
aaagaagcca tgtgtaacaa agagaaatgt 840 
aagaggagta acatgaaata aacattcttt 900 
ccagaaaggc cgtagcagtt tgacgatagg 960 
ccagaatttt tatttctagg aatgtatcct 1020 
caatgtttag gaattgtcct tatagcattg 1080 
tccctgtcag tggaaaaggg gttcaatgaa 1140 
cattaaaaat tgttgaacaa cggccaggca 1200 
tgggaggcca aggtgggaag attgcctgag 1260 
cggtgaaacc ccgtctctac taaaatacaa 1320 
gtagtcccag ctgctcagga ggctgaggca 1380 
ttgcactgag ctgagattaa gccaccgcac 1440 
tcccaagaaa aaaaaattgt tcaacaataa 1500 
taaacagaaa aagcaagatt tttaaaacta 1560 
aaggaagctt gtttgtaata cgtgtgcata 1620 
taaggcagtt aaattatctc tgtattgtga 1680 
ttctgtactt aaattttttc taatgaacac 1740 
gcatgtattt ttcaaaatca aaaaaaaaaa 1800 

1804 



<210> 25 

<211> 2663 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 584050CB1 

<400> 25^ 

ggagaaagga tggccggcct ggcggcgcgg 
gcgagcggct cccagggcga ccgtgagccg 
gagcagaact gctctggggg cgctctgaat 
agtctagcag gctggacctg tcgggacgac 
gggctctacc tccaggaagg tcacaaagtg 
cggttcctgt tctttcaaga gccggcatcg 
agcctggtga tgctctgccg ctaccgcacc 
acctgtgtgg ccttcgcctg ggtgtccctc 
accagggaca ctgacctcac agagaaaatg 
cactcaatct acctgtgctg cgtcaggacc 



ttggtcctgc tagctggggc agcggcgctg 60 
gtgtaccgcg actgcgtact gcagtgcgaa 120 
cacttccgct cccgccagcc aatctacatg 180 
tgtaagtatg agtgtatgtg ggtcaccgtt 240 
cctcagttcc atggcaagtg gcccttctcc 300 
gccgtggcct cgtttctcaa tggcctggcc 360 
ttcgtgccag cctcctcccc catgtaccac 420 
aatgcatggt tctggtccac agtcttccac 480 
gactacttct gtgcctccac tgtcatccta 540 
gtggggctgc agcacccagc tgtggtcagt 600 
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PCTAJSOQ/05621 



gccttccggg ctctcctgct gctcatgctg accgtgcacg tctcctacct gagcctcatc 660 
cgcttcgact atggctacaa cctggtggcc aacgtggcta ttggcctggt caacgtggtg 720 
tggtggctgg cctggtgcct gtggaaccag cggcggctgc ctcacgtgcg caagtgcgtg 780 
gtggtggtct tgctgctgca ggggctgtcc ctgctcgagc tgcttgactt cccaccgctc 840 
ttctgggtcc tggatgccca tgccatctgg cacatcagca ccatccctgt ccacgtcctc -900 
tttttcagct ttctggaaga tgacagcctg tacctgctga aggaatcaga ggacaagttc 960 
aagctggact gaagaccttg gagcgagtct gccccagtgg ggatcctgcc cccgccctgc 1020 
tggcctc.cct tctcccctca acccttgaga tgattttctc ttttcaactt cttgaacttg 1080 
gacatgaagg atgtgggccc agaatcatgt ggccagccca ccccctgttg gccctcacca 1140 
gccttggagt ctgttctagg gaaggcctcc cagcatctgg gactcgagag tgggcagccc 1200 
ctctacctcc tggagctgaa ctggggtgga actgagtgtg ttcttagctc taccgggagg 1260 
acagctgcct gtttcctccc caccagcctc ctccccacat ccccagctgc ctggctgggt 1320 
cctgaagccc tctgtctacc tgggagacca gggaccacag gccttaggga tacagggggt 1380 
ccccttctgt taccaccccc caccctcctc caggacacca ctaggtggtg ctggatgctt 1440 
gttctttggc cagccaaggt tcacggcgat tctccccatg ggatcttgag ggaccaagct 1500 
gctgggattg ggaaggagtt tcaccctgac cgttgcccta gccaggttcc caggaggcct 1560 
caccatactc cctttcaggg ccagggctcc agcaagccca gggcaaggat cctgtgctgc 1620 
tgtctggttg agagcctgcc accgtgtgtc gggagtgtgg gccaggctga gtgcataggt 1680 
gacagggccg tgagcatggg cctgggtgtg tgtgagctca ggcctaggtg cgcagtgtgg 1740 
agacgggtgt tgtcggggaa gaggtgtggc ttcaaagtgt gtgtgtgcag ggggtgggtg 1800 
tgttagcgtg ggttagggga acgtgtgtgc gcgtgctggt gggcatgtga gatgagtgac 1860 
tgccggtgaa tgtgtccaca gttgagaggt tggagcagga tgagggaatc ctgtcaccat 1920 
caataatcac ttgtggagcg ccagctctgc ccaagacgcc acctgggcgg acagccagga 1980 
g'ctctccatg gccaggctgc ctgtgtgcat gttccctgtc tggtgcccct ttgcccgcct 2040 
cctgcaaacc tcacagggtc cccacacaac agtgccctcc agaagcagcc cctcggaggc 2100 
agaggaagga aaatggggat ggctggggct ctctccatcc tccttttctc cttgccttcg 2160 
catggctggc cttcccctcc aaaacctcca ttcccctgct gccagcccct ttgccatagc 2220 
ctgattttgg ggaggaggaa ggggcgattt gagggagaag gggagaaagc ttatggctgg 2280 
gtctggtttc ttcccttccc agagggtctt actgttccag ggtggcccca gggcaggcag 2340 
gggccacact atgcctgcgc cctggtaaag gtgacccctg ccatttacca gcagccctgg 2400 
catgttcctg ccccacagga atagaatgga gggagctcca gaaactttcc atcccaaagg 2460 
cagtctccgt ggttgaagca gactggattt ttgctctgcc cctgacccct tgtccctctt 2520 
tgagggaggg gagctatgct aggactccaa cctcagggac tcgggtggcc tgcgctagct 2580 
tcttttgata ctgaaaactt ttaaggtggg agggtggcaa gggatgtgct taataaatca 2640 
attccaagcc tcaaaaaaaa aaa 2663 



<210> 26 

<211> 769 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_f eature 

<223> Incyte ID No: 863808CB1 

<400> 26 

gcgacgccga cgcaaggctg ctgctatggg gccgggcggc cgtgtggcgc ggctgctcgc 60 
cccactaatg tggcgcaggg cggtttcctc ggtggcgggg tccgcggttg gagccgagcc 120 
cgggcttcgg ctgctggccg tgcagcggct tcccgtagga gcagcgttct gccgggcttg 180 
ccagacccca aactttgtcc gcggcctgca cagcgagcct gggctggagg agcgggcgga 240 
ggggacggtc aacgagggac gcccagaatc ggacgcggca gatcatactg gtcccaagtt 300 
tgacatcgat atgatggttt cacttctgag gcaagaaaat gcaagagaca tttgtgtgat 360 
ccaggttcct ccagaaatga gatatacaga ttactttgtg attgttagtg gaacttctac 420 
ccgacactta catgccatgg ccttctacgt tgtgaaaatg tacaaacacc tgaaatgtaa 480 
acgtgaccct catgttaaga tagaagggaa ggacactgat gactggctgt gcgtggattt 540 
tggcagcatg gtgattcatt tgatgcttcc agaaaccaga gaaatctatg aattagagaa 600 
attatggacc ctacgttctt atgatgacca gttagctcag atagcacctg^ agacagtacc 660 
tgaagacttc attcttggaa tagaagatga tacttcatct gtgactccag' tggagttaaa 720 
atgtgaataa aatattttat gcactgcgtt agtcaaaaaa aaaaaaaaa 769 



<210> 27 
<211> 1257 
<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_featurG 

<223> Incyte ID No: 978433CB1 

<400> 27 

gaggcgcgcg ggtgaaaggc gcattgatgc 
cagacgctga ccacgttcct ctcctcggtc 
agccgggagc catgcgaccc cagggccccg 
tgctgctcct gctgctgcag ctgcccgcgc 
agcaaaaggc gcagctccgg cagagggagg 
aagggccagc aggagtgcct ggtcgagacg 
cacctgggat cccaggtcgg gatggattca 
gctttgagga gtcctggaca cccaactaca 
gcatagatct tgggaaaatt gcggagtgta 
taagagtttt gttcagtggc tcacttcggc 
ggtatttcac attcaatgga gctgaatgtt 
atttggacca aggaagccct gaaatgaatt 
tggaaggact ttgtgaagga attggtgctg 
cttgttcaga ttacccaaaa ggagatgctt 
ttattgaaga actaccaaaa taaatgcttt 
tgccttggaa tggttcactt aaatgacatt 
aaagcaaagc taaatatgtt tacagaccaa 
cattattcat tttgcttcaa tcaaaagtgg 
ctttcttcat agtcacattc tctcaaccta 
ttttctctta gtatagcatt tttaaaaaaa 
taaatgttaa gaattttttt tatatctgtt 



agcctgcggc ggcctcggag cgcggcggag 60 
tcctccgcct ccagctccgc gctgcccggc 120 
ccgcctcccc gcagcggctc cgcggcctcc 180 
cgtcgagcgc ctctgagatc cccaagggga 240 
tggtggacct gtataatgga atgtgcttac 300 
ggagccctgg ggccaatggc attccgggta 360 
aaggagaaaa gggggaatgt ctgagggaaa 420 
agcagtgttc atggagttca ttgaattatg 480 
catttacaaa gatgcgttca aatagtgctc 540 
taaaatgcag aaatgcatgc tgtcagcgtt 600 
caggacctct tcccattgaa gctataattt 660 
caacaattaa tattcatcgc acttcttctg 720 
gattagtgga tgttgctatc tgggttggca 780 
ctactggatg gaattcagtt tctcgcatca 840 
aattttcatt tgctacctct ttttttatta 900 
ttaaataagt ttatgtatac atctgaatga 960 
agtgtgattt cacactgttt ttaaatctag 1020 
tttcaatatt ttttttagtt ggttagaata 1080 
taatttggaa tattgttgtg gtcttttgtt 1140 
tataaaagct accaatcttt gtacaatttg 1200 
aaataaaaat tatttccaac aacctta 1257 



<210> 28 
<211> 2560 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 16553 69CB1 

<400> 28 

ttccagtgaa gagcaagtgc tgcccgaccc aggaccctgt gccaggctag cagccctcca 60 
gctccctcca gagaggaaac ctctgtctgg ctgagggtgg gactagctgg gatgtctcac 120 
tccagttgct caggttcacc caggaagctc ctccgtggag tggccagcct gattctagcc 180 
ctgtcctctc tggcagcaca tgccacacct gcctgggcct tctgctccct gatgcttgat 240 
gagcccctgc ctcctcaatg tttctcaaag acagaccccc ctgaggccag cttgaatgtg 3 00 
aagactgctg aagtcagctg gcttcacttg agctgcagaa aaggtggctg ggatggccca 3 60 
ggtgcaccca gaggccccag ccctttggct gcctttgggt tgtgacttgg gttgtctctg 420 
aggccctgcc agagctgggc ctgcgggtgg tgggcggtcc gacctcgggc agtcagtgct 480 
ccgcagcctc agcactgcat cccagaccca gtgtcctcag agggaagagc cagcctccct 540 
gcctcatgga accaggagtc ccaaaaagtc aggagcctgg aggctctgaa aggagcaggg 600 
attccatagt gcgtgaagct gaaataggcg ccctcctggg gagcccccag caaaactgtt 660 
tttcataccc actcccagaa ctgccccgct ccagctccag cgccagcgcc agctggttgc 720 
caggcgtcat tggagaggcc tggctgcccc aggggcagca gggagtggtg gacctgtatg 780 
ggctggcagg aggccattgg ccatgctgac aagtgtcacc tgccttccta gcctggagcc 840 
acccctcagg tggcctgctt gcacctccta tccggaggta gcctgcccca cctgtaggca 900 
gagggggctc ttgcttgagg cctgcacagg aagcaagtat agccccggtg ccccagagtg 960 
ggttccactt agccctggcg agatggcctg tcctgagatc tctgctccca gaccccacca 1020 
tctggggagc acagtcctta ggctgcctgg tccaggaagg gggtgcggct ctgtcaggaa 1080 
acctggactc tcaaggccca ccagcctctc cgtgagtgtt agaaatcaca gatacagtat 1140 
atacttaatt acactaaatt attgctggga ttccttataa gcactaatta tacctgatta 1200 
taggttaaaa tatttatttt gtcaaaatat tttcttggga atgtgtttaa ccctttctgc 1260 
gttcattgtt gctgagatgt gaaaactaac cattccctcc tgcctacctt tttggccact 1320 
gggcggcaga gaatggcgct atgtgcagtt gggcccctgg caeca tgggc ctttggcctg 1380 
cctgctgcag agtagccctg cctgggcagt ctccaggcac tgagcaggcc atctgtggcc 1440 
aggctgagag aatgactggc tcgcttacca gcgtgcatgg gacaaggagc tttggagcct 1500 
caaggggttg ttgctggcct gggctagagg gaaaggtgac catccgtctg tcctcctgtc 1560 
tttctattag cgcctccatg tgagtgatgg tgccttggtt cactagcctt cccccaccac 1620 
cccaccatgc cacctggtgg tcttggggcc tgtgctgtca ctccagcccc tggggaggag 1680 
aggacccagc ccggagagtt ggggcaaggg ctccacatgg cccaagggca acagatgctc 1740 
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gcagggcagc tgctgccgat gctcacgctc 
accctgggcc cccgcagaca cgcatctcta 
tggggtaggc catgggccca cctggggcca 
agtgtggagg agcacttgct tgcagcctgg 
ctgggagctt ctgcactgtc ggctttgggg 
ttccccaccc agagagaagt gtttccaccc 
ctcgcctccc ccagtgcccg tcaccagccc 
ccctgcggcc accagccata gggagcagcc 
tgtccatacc tccaggctct cccggagagg 
cttaattgtg aggattctca ggattgttgg 
tcgcgtttgc tgtccactcg tcctagaagt 
gtgggcagag ctggttctgg agggtgggtc 
ccccactggc cctccttcca gataccctct 
ccacaataaa taaataattg aacaaattaa 



ctgcccccct ccttcccgct gccacacccc 1800 
actcagttgg gcccagcctt ctggatggct 1860 
ggccagcccc tggggcagct ctggaagagc 1920 
cttcagcctc tggcactgct ggagtggtcc 1980 
acgtctcacc cacttgggtt acagtaggcc 2040 
cagagacatt gtctgtcagc ccctgaagtg 2100 
ttcctatctg tggggtccaa gtcaggcttc 2160 
atcagccccc gagtcagaac tgcttctgtc 2220 
gggacggata tttatttcct aaagtttgca 2280 
gggctactga aaagaggaat gtgttgaatg 2340 
ttagtgtttt tgtcactgtc atgtgtttct 2400 
agtgcacccg aggctcagag catccatcca 2460 
ctctaattgg gttcttgcat gtaaaatact 2520 
aaaaaaaaaa 2560 



<210> 29 

<211> 614 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_feature 

<223> Incyte ID No: 1703244CB1 

<400> 29 

gtgcaagagg aacaaagaaa gggactcctg 
tatctcatcc ttgctggctt cttcactctg 
acagatgcct gctttgtcta tatctaccag 
taccctaagg tgcagatgct gatgtacatg 
gcctatgctc tcaccttccc tggttgctcc 
ggaggcatcg gccaggcaca gttctcgcac 
ttcacctacc gtgtgcctga ggacacctgg 
gcgctgggcc cccacctgct ggcctaccgt 
ccaccaccct ccgaccccct agccctccac 
aggacccagg actctgttta cgtgcccagt 
gttttagaga cagg 



cagcgtccgg ctgacctggc ccttgtcata 60 
ttccggggcc tggtggtgct tgattgcccc 120 
tatgagccat acctgcggga ccctgtggcc 180 
ttttatgtcc tgcctttctg cggcctggct 240 
tggcttccag actgggcctt ggtgtttgct 300 
atgggggctt ccatgcacct gcgcacaccc 360 
ggctgcttct tcgtgtgcaa tctgctgtat 420 
tgccttcagt ggcccgcatt cttccaccag 480 
aagaagcagc attgagagag ctgtggactc 540 
cagccctacc tggggaagcg ggggttgggt 600 

614 



<210> 30 

<211> 1936 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1730819CB1 

<400> 30 

gactacgggc tcacagccgt cccttcgctg 
cgctggggca acccggctgc tcctgctctt 
ccggggcagc ggctgccggg ccgggactgg 
gggcgaggcc tgtggcacgg tggggctgct 
tgccaacttc cggaagcggg gctcactgct 
gtcacagcgg cagctcagcg aggaggagcg 
tggcctgtac cgggtccgga tcccaaggcg 
tggctatgtc tcctcctttg tccctgcgtg 
gctgaccctg cacgtggatg tggccggcaa 
cgggggctgc cggggccatg aggtggagga 
gcagctgcag ccgcccacca cagccccagg 
ggagatggaa caggcccaga aggccaagaa 
atactggatg tacatcattc ccgtcgtcct 
cgggggccag ggtgggggtg ggggttgtgg 
caggctggtc agcgtcccgt cttgcacacc 
gtgtcctcag ccatcccaag aagggtttgc 
ccacctgggc cagccccttg tcctctgcct 
cctttggcac agcagccggt gtctcctgcg 



gtgggaagaa gccgagatgg cggcagccag 60 
gctgatggcg gtagcagcgc ccagtcgagc 120 
tgcgcgaggg gctggggcgg aaggtcgaga 180 
gctggagcac tcatttgaga tcgatgacag 240 
ctggaaccag caggatggta ccttgtccct 300 
gggccgactc cgggatgtgg cagccctgaa 360 
acccggggcc ctggatggcc tggaagctgg 420 
ctccctggtg gagtcgcacc tgtcggacca 480 
cgtggtgggc gtgtcggtgg tgacgcaccc 540 
cgtggacctg gagctgttca acacctcggt 600 
ccctgagacg gcggccttca ttgagcgcct 660 
cccccaggag cagaagtcct tcttcgccaa 720 
gttcctcatg atgtcaggag cgccagacac 780 
tggtggtggg ggtagtggcc ggtgagggcc 840 
caggggcctc cctttctgct ggagtcccct 900 
tggtccctcc tttccccccg tcccacgagg 960 
tctgctggca gaggagcagc tggactgggg 1020 
cccgcctccc cdatggcccc atgcagcccc 1080 
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aggggcttcc cccctgccca tggagtagag cccgagatcc tggccactat gccagttctg 1140 
acctcgcatc cccctacccc gagcccatgc agtctgggaa catgccgcct tctctccagc 1200 
ctctgtgcct ttgttccagg tggtctcacc ctcctgtccc tggctgggct aggtggtcct 12 60 
gtccaggctc ctgcagcgcc cccctcactt tgacactgga ctaggatgca gcctcccttc 1320 
tgtgtcccct tgagggtacc ctgggtcccc tcatcagggg cagaggcatg aaagagtcgg 1380 
ggctggatgg ccgggggctt ctgggcccga cgcctagtgc agcccctggg gtcgtggttt 1440 
gacatttgtc tgcctggtgc aaacaaggaa tccttgcctt taaggtgaca ggccctccac 1500 
aggcttccag acttgaagga aaaggtttaa gaaagaaaac aaaaccaaca gttagtggag 1560 
tcaaagccca gacactgtaa atagaacccc ctccaccacc ccccgccgcc cagcatccta 1620 
cctggactgc ggtgctacga gggcctgcgg gcctttgctg tgtgccaccc tccctgtaag 1680 
tctatttaaa aacatcgacg atacattgaa atgtgtgaac gttttgaaaa gctacagctt 1740 
ccagcagcca aaagcaactg ttgttttggc aagacggtcc tgatgtacaa gcttgattga 1800 
aattcactgc tcacttgata cgttattcag aaacccaagg aatggctgtc cccatcctca 1860 
tgtggctgtg tggagctcag ctgtgttgtg tggcagttta ttaaactgtc ccccagatcg 1920 
acacgcaaaa aaaaaa 1936 



<210> 31 

<211> 1958 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 1757161CB1 

<400> 31 

gccgcgcctt cagctacggc ccgagcgagc ccgccgccgc cgggcccggc cacagcctgc 60 

agcggagccc acgagaggca gcgccatggc ggagcagacc tactcgtggg cctattccct 120 

ggtggattcc agtcaagtgt ctacatttct gatttccatt cttcttatag tctatggtag 180 

tttcaggtcc cttaatatgg actttgaaaa tcaagataag gagaaagaca gtaatagttc 240 

ttctgggtct ttcaatggca acagcaccaa taatagcatc caaacaattg actctaccca 300 

ggctctgttc cttccaattg gagcatctgt ctctctttta gtaatgttct tcttctttga 360 

ctcagttcaa gtagttttta caatatgtac agcagttctt gcaacgatag cttttgcttt 420 

tcttctcctc ccgatgtgcc agtatttaac aagaccctgc tcacctcaga acaagatttc 480 

ctttggttgc tgtggacgtt tcactgctgc tgagttgctg tcattctctc tgtctgtcat 540 

gctcgtcctc atctgggttc tcactggcca ttggcttctc atggatgcac tggccatggg 600 

cctctgtgtc gccatgatcg cctttgtccg cctgccgagc ctcaaggtct cctgcctgct 660 

tctctcaggg cttctcatct atgatgtctt ttgggtattt ttctcagcct acatcttcaa 720 

tagcaacgtc atggtgaagg tggccactca gccggctgac aatccccttg acgttctatc 780 

ccggaagctc cacctggggc ccaatgttgg gcgtgatgtt cctcgcctgt ctctgcctgg 840 

aaaactggtc ttcccaagct ccactggcag ccacttctcc atgttgggca tcggagacat 900 

cgttatgcct ggtctcctac tatgctttgt ccttcgctat gacaactaca aaaagcaagc 960 

cagtggggac tcctgtgggg cccctggacc tgccaacatc tccgggcgca tgcagaaggt 1020 

ctcctacttt cactgcaccc tcatcggata ctttgtaggc ctgctcactg ctactgtggc 1080 

gtctcgcatt caccgggccg cccagcccgc ccttctctat ttggtgccat ttactttatt 1140 

gccactcctc acgatggcct atttaaaggg cgacctccgg cggatgtggt ctgagccttt 1200 

ccactccaag tccagcagct cccgattcct ggaagtatga tggatcacgt ggaaagtgac 1260 

cagatggccg tcatagtcct tttctctcaa ctcatggttt gtttcctctt agagctggcc 1320 

■tggtactcag aaatgtacct gtgtttaagg aactgccgtg tgactggatt tggcatttaa 1380 

agggagctcg tttgcaggag agaggtgctg gagccctgtt tggttccttc tcttcctgcg 1440 

gatgtagagg tggggcccct tccaagaggg acaggcctct ccccagcgcg ccttcctccc 1500 

acgtttttat ggatctgcac cagactgtta ccttctgggg gagatggaga tttgactgtt 1560 

taaaaactga aaacagcgag gagtctttct agaacttttg aacactaaaa ggatgaaaaa 1620 

aattagcaaa ccgaagtttc ttcaatgacc cctcgagaac tttgggacca gtttcctatg 1680 

ggggactcag tttcagagaa ctgagacaga agctcttctg tcgttatatt cttctttcct 1740 

ttttttggat ttattaaata ttttctgtgg tgtgaagtga cttattaaat ccacagacat 1800 

tgagtgactt cttacaacat ccacataaga atttgttgta atgagttcat gtccacccag 1860 

atgttgtgtt ggcagtgaac aagggcacgg tttttataca tacgtacata tatatatata 1920 

aacacacaca ""tagatatata tgaataaaca aaaatgat 1958 



<210> 32 

<211> 1424 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> inisc_f eature 

<223> Incyte ID No: 1976095CB1 



<400> 32 

gtcaagtagg 

gaggctgagc 

atggacatcc 

ctcatggctc 

gccgtgctga 

cagataaagg 

accggagcca 

ccccactttg 

cggtttgtgg 

gtggtctgca 

cggagagtac 

ggaagctggg 

ggctgctgcc 

caaatggaac 

aaggctgtca 

caattagaac 

gagagaagac 

tctctaactt 

aggaaacact 

tttgcctccc 

cctgaggcta 

ctgccaagcc 

agggagaatc 

ctcaaatatt 



agacaaggag 
agggaaaaag 
tggtcccact 
tgctgggctg 
ctcccaagag 
ggcttacagg 
actttcagtt 
agaagttcct 
tggctcctgg 
ctctggtgct 
tgagaccggg 
ccttcatgtg 
tcaccagaga 
gacagccccc 
aataatcttt 
aagccaccca 
attcatgtac 
caatcccgcc 
aggaccctgt 
aatgttgtcc 
cacccatgcg 
cccctgaccc 
agagatgctg 
ttttaataaa 



caaagtccta 
ccagtgcccc 
cctgcagctg 
ctggcagccc 
caaccgcaag 
agcctccggg 
ctacccaccg 
gacaaagagc 
agaggacatg 
gtgctctgtg 
aggtgtgctc 
gcagcaagtt 
gacctggaag 
tcccttgaag 
cccaagctcc 
ccagcctatc 
cacctcctag 
ttcgacagtg 
tgtatcctca 
ctttccttcg 
tctctaggaa 
tctctcccca 
gggatgccag 
tagacgaaac 



tcacagcggg 
agcggaagca 
ctggtgctgc 
ctgtgcaaaa 
atggagagca 
aaagtggccc 
ggctgcaggg 
atggctgaga 
agacagctgg 
cagagcccaa 
tttttctggg 
ttcgagccca 
gate tt gaga 
tggctacctg 
aaggcactca 
tatcttccac 
tccctctctc 
aaaaagctct 
actgcaagtt 
ttcccatggt 
ctggtcacaa 
ctaccacctt 
agcaagactc 
cacgaaaaaa 



aggggacgcc 
cagctcagag 
ttcttaccct 
gctacttccc 
agaaacggga 
tactggagct 
tcacctgcct 
acaggcacct 
ctgatggctc 
ggaaggtcct 
agcatgtggc 
cctggaaaca 
acgcccagtt 
ttgggcccca 
tttgctcctt 
tgagagggac 
cccaacctct 
acttctacgc 
tctggactag 
aaagctcctc 
aagtcatggt 
cttcctgagc 
aaagaggcag 
aaaa 



agcgcctgca 
ctggtctgcc 
gcccctgcac 
ctacctgatg 
gctcttcagc 
gggctgcgga 
agacccaaat 
ccaatatgag 
catggatgtg 
gcaggaggtc 
agaaccatat 
cattggggat 
ctccgaaatc 
catcatggga 
ccccagcctc 
ctagcagaat 
gccagggcaa 
tgacccaggg 
tctcccaacg 
tcgctttcct 
gcctgcatcc 
tgggggcacc 
aggttttgtt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1424 



<210> 33 

<211> 2238 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 2169991CB1 



<400> 33 

cctgtctgtg cacacctcac ggcaagggcc 
ttgctgcttt taattcaact cagaggtatg 
gatggcagga catggatggc ccttgaggca 
caagaagggc agtggcctgc ctgacttggg 
agaactgaaa ctagccaggc caattctcca 
tagattaggt ctggagtcag aaggtagcag 
gaagtggtcc tttggtcagg gtgggaagcc 
agggaagagg cgctggtggt. tgtgctttgg 
ctatgacaag ggagacagcg cctggggcag 
caggaggtat gagtgggcag tgggcgagcc 
gcaggtttaa gcgtgggacc cacgtcagat 
ctcaggtcct ccctgctggc ttcccactcc 
tgggtgaggg gcaggacaga gccctttcct 
gaaggggagt ccttctagcc cctgacagct 
tagccagggt tgagttctca cccacctgtg 
cttggggact gagcaggccc tcactgtcac 
accacctctg gggaaggtgt gagaggagag 
tgaagacttc ctgcgatgag aacagaggca 
ggactggagg gggccatggg gcaccggacc 
tgtgtcactg cggggacccc ggaggtgtgg 
tcgtccttca ccatccgttg tgggttcctg 
agctgggggg gccccaacgg tgctgggggg 
ggcatccggc aatgggcccc tgctcgccag 
ctcatcctgg aaggctctgg ggccagcagc 
tttgcgtcct tccctgaggg ctcctgggag 
ccagggctct ctgccccgcc gactcctgcc 



agcctgtttc ctcccggtca cctccaaatc 60 
cacttgaggt aggagggcag gggaagtggg 120 
ttggctctgg gtgtcatggg ctgtgagagt 180 
ttcgaaaggg tcactctggc cactgcggtg 240 
ttgttcctgc tcttccaggt aggagaatat 300 
gggctggggg ttgcaggggg atgttgagaa 360 
aacaggattt cctggtgcat tggaggtgaa 420 
gcctgagcag ccagaagccg ttgccatcac 480 
agcccagtat ggggtgcatt cagggtagat 540 
aggctggggg ttgtggggga ggcctgggtt 600 
tggtggtggg tcatgcatgc tggggtggct 660 
caggggcttt ctcctcccag attccttagc 720 
agggaagccc ggcaccccct gctgtccagg 780 
tctctgcccc tcccctggcc tccccaggcc 840 
ccgccctgcc ttgttacctg gaagcacagc 900 
tttaagaagg gaatcagcca ctttgtgctc 960 
aaggaagtgg ctgtttggct gctgacaaca 1020 
caggtgccgg ccctgcagcc cccagaacct 1080 
ctggtcctgc cctgggtgct gctgaccttg 1140 
gttcaagttc ggatggaggc caccgagctc 1200 
gggtctggct ccatctccct ggtgactgtg 1260 
accacgctgg ctgtgttgca cccagaacgt 1320 
gcccgctggg aaacccagag cagcatctct 1380 
ccctgcgcca acaccacctt ctgctgcaag 1440 
gcctgtggga gcctcccgcc cagctcagac 1500 
cccattctgc gggcagacct ggccgggatc 1560 



22/28 



wo 00/52151 



P.CT/USOO/05621 



ttgggggtct caggagtcct cctctttggc 
cataagcacc gccctgcccc taggctccag 
gcacgagcat gggcaccaag ccaggcctcc 
atcaacacca gctgccgccc agctactttg 
tggtgggcgt cactccccac ccacgctgca 
tccacaccca tccctgcacg tggcagcttt 
gcaggggaga ggcctcctca cactggtccc 
cccagggcca tggaaggacc cttaggagtt 
ttccccctcc caggcctcct gggtgtcacc 
gtgtcccata ggtgtctggc caggcccacc 
tgtgggcaca ggtgtgagtg tgtgagtgac 
aactaagtca gcaacgcc 



tgtgtctacc tccttcatct gctgcgccga 1620 
ccgtcccgca ccagccccca ggcaccgaga 1680 
caggctgctc ttcacgtccc ttatgccact 1740 
gacacagctc acccccatgg ggggccgtcc 1800 
caccggcccc agggccctgc cgcctgggcc 1860 
gtctctgttg agaatggact ctacgctcag 1920 
ggcctcactc ttttccctga ccctcggggg 1980 
cgatgagaga gaccatgagg ccactgggct 2040 
cccttacttt aattcttggg cctccaataa 2100 
tgctgcggat gtggtctctg tgtgtgcgtg 2160 
agttacccca tttcagtcat ttcctgctgc 2220 

2238 



<210> 34 

<211> 536 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> raisc_feature 

<223> Incyte ID No: 2616827CB1 

<400> 34 

gcatgaactt gggggtcagc atgctgagga tcctcttcct cctggatgta ggaggagctc 60 
aagtgctggc aacaggcaag acccctgggg ctgaaattga tttcaagtac gccctcatcg 120 
ggactgctgt gggtgtcgcc atatctgctg gcttcctggc cctgaagatc tgcatgatca 180 
ggaggcactt atttgacgac gactcttccg acctgaaaag cacgcctggg ggcctcagtg 240 
acaccatccc gctaaagaag agagccccaa ggcgaaacca caatttctcc aaaagagatg 300 
cacaggtgat tgagctgtag gtgagcagtg acgtgaagag gggttctagc cccgtggaaa 360 
acagcccatg gttaacatct caggatgttc tgcattcaaa cacccaaggc tggtaatgaa 420 
ctttcacatg gactgaatat tggaggcaaa taatagaagg aatagaatat acagtgcctc 480 
tgtcctgaag gaaaatatca tgcctcttct ggaagaaacg gactgcacag aggaag 536 



<210> 35 
<211> 2177 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No: 



2991370CB1 



<400> 35 

cgggaggctc 

gcagaggcgg 

ggatggctcc 

ggtttctgcc 

actcagttct 

ttactgatgt 

acctggttct 

gcagctcacc 

gcgacatcga 

cgggggtggc 

tcctgagcga 

tggcctgtgt 

acggtaatgt 

ggggcattct 

gccgaggcgt 

atgagaatgg 

cggccagtgc 

tcaaccgtga 

atctgcaaat 

ccatgccctc 

agatcttctt 

tccgtagaga 



gaggccagcc 
cagcgagcgc 
gagcgctgac 
catcactgag 
gcctcctgac 
ggaccatgat 
gaagtatgac 
ctactacgcg 
cggggacggc 
cacgtacacc 
tgaggtcaac 
ggacagaaag 
gggccctgat 
ggcgctcaga 
cagcgtgggc 
gcctaacttc 
tggtgtggac 
tggcaaagtg 
gagcacccat 
ccctgtccgc 
caacaacatt 
gcacggagac 



cgggaccggg 
ccgcttccca 
cccggcatgt 
gggtcccagc 
tatgacagta 
ggggactttg 
cgggcccaga 
ctgcgggacc 
cgggaggaga 
gacaagttgt 
gtggcccgtg 
ggctctggac 
gccctcattg 
gatgtggctg 
cccatcctca 
cttttccaca 
gacccccacc 
gacatcgtct 
gggaaggtcc 
acggtcatca 
gcctaccgca 
cccctcatcg 



gctgggagca 
cgcccctagg 
ccaggatgtt 
gggctgaacc 
atcccaccca 
agatcgtcgt 
agcggctggt 
ggcaggggaa 
tctacttcct 
tcaagttccg 
gtgtggccag 
gctactctat 
aaatggaccc 
ctgaggctgg 
gcagcagtgc 
accggggcga 
agcatgggcg 
atggcaactg 
gcttccggga 
ccgccgactt 
gctcctcagc 
aggagctcaa 



agcaggcggc 
cggcggggcc 
accgttcctg 
catgttcact 
gctcaactat 
ggcggggtac 
gaacatcgcg 
cgccattggg 
caacaccaat 
caataaccgg 
cctctttgcc 
ctacattgcc 
tgaggccagt 
ggtcagcaaa 
ctcggatatc 
tggcaccttt 
aggtgtcgcc 
gaatggcccc 
catcgcctca 
tgacaatgac 
caaccgcctc 
tcccggcgac 



ggcgccggcg 
gagagcggga 
ctgctgctct 
gcagtcacca 
ggtgtggcag 
aatggaccca 
gtcgatgagc 
gtcacagcct 
aatgccttct 
tgggaagaca 
ggacgctctg 
aattacgcct 
gacctctccc 
tatacagggg 
ttctgcgaca 
gtggacgctg 
ctggctgact 
caccgcctct 
cccaagttct 
caggagctgg 
ttccgcgtca 
gccttggagc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



23/28 



wo 00/52151 



PCTAJS00/05d21 



ctgagggccg gggcacaggg ggtgtggtga 
tcatcttgtc ccatggagag tccatggctc 
gcttcaacaa caactggctg cgagtggtgc 
gagctaaggt cgtgctctac accaagaaga 
gctcaggcta cctgtgtgag atggagcccg 
ccagcagtgt ggaggtgacg tggccagatg 
gggagatgaa ctcagtgctg gagatcctct 
cagccccact ggagtgtggc caaggattct 
ccaatgaatg catccagttc ccattcgtgt 
cctatggaag ctacaggtgc cggaccaaca 
aggatggcac agcctgcgtg ggctggtgga 
ttgggaagag ccttggtccc tgaatcactg 
cctgttgatc aggaacactt acctggaact 
ttaagctatt aatacattaa gatttggggg 
tcttgaaaaa aaaaaaa 



ccgacttcga cggagacggg atgctggacc 1380 
agccgctgtc cgtcttccgg ggcaatcagg 1440 
cacgcacccg gtttggggcc tttgccaggg 1500 
gtggggccca cctgaggatc atcgacgggg 1560 
tggcacactt tggcctgggg aaggatgaag 1620 
gcaagatggt gagccggaac gtggccagcg 1680 
acccccggga tgaggacaca cttcaggacc 1740 
cccagcagga aaatggccat tgcatggaca 1800 
gccctcgaga caagcccgta tgtgtcaaca 1860 
agaagtgcag tcggggctac gagcccaacg 1920 
gccctgtgtt gaagatagtg acaccacaag 1980 
aatcactgcc ttgaatcacc gcctggaata 2040 
tcactgagca ggatacaaac ttctattgta 2100 
tgctacctta cataataaat tcccatttcc 2160 

2177 



<210> 36 
<211> 2043 
<212> DNA 

<213> Homo* sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 3031062CB1 

<400> 36 

cgccacgacg cagcggggaa tctgcagtag 
cgccgcttcg gctctggctg ctgttgttcc 
agtcaggttc aaaatggaaa gtatttattg 
aaccatgttc aagtcaaaac tgcagctgct 
ctttccgagg aggcatctcc aggaagatga 
cccactatca gatcactaag aacagactgt 
ggtgtagtgg tgttgagcac tttattttgg 
tggtgatcaa tgtacgagat tatcctcagg 
tcttctcctt cagtaagaca tcagagtacc 
gggaaggggg acctgctgtt tggccaattt 
tcagagaaga tctggtaagg tcagcagcac 
catatttccg aggatcaagg acaagtccag 
aaaacccaaa acttgttgat gcagaataca 
ataccttagg aaagccagct gctaaggatg 
atctgtttaa ttttcgaggc gtactgcaag 
ccattattct aatgagaaag agaacatact 
atgtatttac agatgtttat tgagtacctg 
actagaaaat acgatatttg tcactctgct 
tggaacatag tagatgaaaa aaaatacatg 
caagttatta tacttatgac tcacaaattt 
tatccagaaa atagttgagt agagatgaat 
tttgcatttg ggataataca ctcaacacat 
atcctgcctt cttcccaaaa ggatttaaag 
taaaataaag aaacagacta tacaacatgc 
ccatcatatg gaacccttat cctcccacca 
ctgtgatctc tcttctttca aaaacgtaga 
cacatatatt ctggatggca tcctccttca 
tcctcatcat ccttgtatcc aggaccacag 
cccatctccc caacctccag caaaagaaaa 
gattgaagat gaagatgagg atgagttcaa 
ggatggagtc tagagcctcc cagagcctgg 
cgtgggccac ggtgacccac catgaagtcc 
gagttgctgc acatcacacc agcccctgcc 
cacgaggagc tctgctgaga ctctcaaggg 
tgc 



gtctgccggc gatggagtgg tgggctagct 60 
tcctgccctc agcgcagggc cgccagaagg 120 
accaaattaa caggtctttg gagaattacg 180 
accatggtgt catagaagag gatctaactc 240 
tggcagaggt agtcagacgg aagctaggga 300 
accgggaaaa tgactgcatg ttcccctcaa 360 
aagtgatcgg gcgtctccct gacatggaga 420 
ttcctaaatg gatggagcct gccatcccag 480 
atgatatcat gtatcctgct tggacatttt 540 
atcctacagg tcttggacgg tgggacctct 600 
agtggccatg gaaaaagaaa aactctacag 660 
aacgagatcc tctcattctt ctgtctcgga 720 
ccaaaaacca ggcctggaaa tctatgaaag 780 
tccatcttgt ggatcactgc aaatacaagt 840 
tttccggttt aaacacctct tcctgtgtgg 900 
agtatggaaa tttcttaagg gcaggaagtc 960 
ttatatatca ggactaagct gctgggatgt 1020 
tccatggaaa ttttagacta gcataatacc 1080 
tggttttagt ggttgaaatt taagcatttg 1140 
acctttcacc gaattaagcc aaaaagactt 1200 
acataggatc ctgcatgatt taactttctc 1260 
ttctctacca catttaaaat ttttatttat 1320 
cagcttacaa aaatgtatag cagggcggga 1380 
atactgtgaa gtcctcttca cttccaggta 1440 
caaaatgtat aattctgctt atatctgtac 1500 
cctttcttct tatgaaaaaa acaatccccc 1560 
ctttgtctgg gactttggct ctgcccactg 1620 
cctttgtcat gcagacatat tcagcttcct 1680 
cccttcattt tcacattccc cttcagccga 1740 
ggatgaagac caggatgagg acaaggatga 1800 
agaggaggcc tcggtcagcc actccgtgga 1860 
ccactagcca ctcgattccc tgctctgtca 1920 
aagagcagga gtcaccacag gctgaatgcc 1980 
agccagtgaa agaaatagaa ataaagcctg 2040 

2043 



<210> 37 
<211> 1743 
<212> DNA 



24/28 



wo 00/52151 

<213> Homo sapiens 



PCT/US00y05621 



<220> 

<221> misc_feature 

<223> Incyte ID No: 3101617CB1 

<400> 37 

cagcaggtca cagcccctcg aggcgacagc 
catggatggg aagaaatgca gcgtatggat 
ttcagctgga ttgtggatag tatacttcat 
aaattcagct gaaaggaaac ctggtgtgaa 
tgatcctcct gcaagctgtg tgtttagtca 
tgtggtagct gttctgcgct tcatacaact 
tattagtgga ttggtggctc tgtgtctggc 
tcagctcaca aatgatgaag aaatccataa 
cacattgacc tgctggatcc aggctgcgct 
acggagagtt ggaattccac gggttattct 
ctacttcatc ctcatggccc aaagcatcca 
ggtcatgtgc ttcctgtctt attttggcac 
tgagattgtt tgctctgagt accaggagaa 
agcttctgaa tatcagactg accaggtgta 
ggtgtgacag tgggggaggg gccagtagga 
tttcacacac acacacacac acacattcat 
cgagttattt ctttaatgaa aaagcacaag 
ctgaaaatat atgcacgaca gagcaagaag 
ttcccagcac tccctcctct tcccattctc 
ggcaggccaa atgttccttg ggagtaatgc 
ggcttggaac cagctcgtga ggaagttctg 
tagtgtatca tagaatagga cggaaattgt 
aaggcatagt gagaagaact ttcccacgaa 
tgtgtggatc ccaggagaga catatgccac 
tctggacttg atgcactgtg actgagaatg 
ggtctatcag gcctggaaca agatgggggc 
tagtatgcca tgtacaatgt tttatatttc 
tctctaagcc tcatggacaa agatgtagac 
caaccatgat caaagaaaaa ctgaggtcac 
ate 



ggccccgccg caccagagca gtggtacagg 60 
gttcctacct cttgtattta ctttgtttac 120 
agctgtggaa gatgacaaaa ttttaccatt 180 
gcatgcacca tatataagca ttgcaggtga 240 
agttatgaac atggcagcct tcctagccct 300 
gaaaccgaag gttttaaacc cgtggctgaa 360 
ttccttcgga atgaccttac ttggtaattt 420 
cgtcggaact tccttgacct ttggatttgg 480 
gacactcaag gtcaacatca agaatgaagg 540 ■ 
gtcggcatct atcactctct gtgtggtcct 600 
catgtatgca gccagggtcc agtggggcct 660 
ctttgccgtg gagttccggc attaccgcta 720 
tttcctaagc ttctcagaaa gcctgtcaga 780 
aaccatcagt ttttccttgc tggtgaggtg 840 
cacactcaca ggacttgaca tagaacctca 900 
ggccacattt gccaaatgag cttttcaggg 960 
cccttatgtg tcgaaataca cgctgttaca 1020 
cttgtgcatg atcacttctt atccgtcccc 1080 
tccacatgtc tcaagcaccc taccgagtag 1140 
caactcccga cgttgccttc aggtccaaag 1200 
aatctggcac taatattctt gagtggataa 1260 
attgagatgt gaccctgtgt cgcctgtgga 1320 
agcccccttc atcgttgttc agtggtcggc 1380 
agactgtgag agcaaagccc gccgctgtga 1440 
atttccaaat gtgaatatgt gtagggacgt 1500 
agtgaaggta tggtttagtg tttgctttca 1560 
atagtttctt ttaagtaact accatgagtc 1620 
caaatgcaag agctgagctt gctttgggtt 1680 
ctgcaggctt acgtgggaag ctaagacaat 1740 

1743 



<210> 38 
<211> 1306 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 3216178CB1 

<400> 38 

ctgcaaagtt cctgtgagcg ctgtcatttt 
gaggctggag tttccaggat gtcaaaatta 
tgggatacct gtgtcactcc tgctgtctgc 
ccccaggagt agggaggaac caggtgggct 
gccttggacg ctgcagcact tctatctctg 
cctcctcctg ctgtggctga gccttggggt 
cctttgctgt cttgggacgg atcaccactg 
ctgccatgtg gcaccagact gccacccaga 
gatgaccaag atggtgctgc agatggtgct 
gagccaccta gactggatgc agagcatggt 
ggatggcagc ctgctccttg cctttgtgcc 
aagaaagggg ccgtagctag ggcagagctc 
caggattgcc gtctgtggac actgaaattt 
cttcctcttt tgttcttctc ctaccatcta 
ttgcacaaaa acaggcagtg gccagatttg 
cctaaatcat cctccatttc tttccttctg 
taaaatgggg tctttccctg tttggtgcca 
ctcaagcagt gcaggcttta tttggtggcc 



gtcactctgg tttttcagat tcttcccctg 60 
cctctgcttg ggtgagctat ttcaagcagc 120 
cagtgactgc ccaggtgtct gctggttcct 180 
ggctgggatg ggtggatatt taaagaccag 240 
cttgatgcct gctgccacgt ggctggtcct 300 
gaagacaggc agctgctccc aaccccagaa 360 
caagagggga agttgctact gtgatgaatt 420 
ccacagtgtc ctctgcaacc ctgcttctca 480 
gaggatggag aacccaccaa gccccgctag 540 
gagctccctg caggttctct gagaaggggt 600 
ctccaggccc caaagtcagg gaaccaaaag 660 
cactgcaatg attgttttag gggtaggagc 720 
gaatctcata tacttttgtg acaaaacatt 780 
aaaatgtaga aaacattctt agcctatgag 840 
gcccatagac catagtttgc tgacttctgc 900 
tgtccttgtt actgacaaag ccactttccc 960 
tgaagccaat atgcaaaacc gaaagtgagc 1020 
atggaattga gaagtgagag cttggctcac 1080 



25/28 



wo 00/52151 



PCTAJS00y05621 



aaatcaactt ttctgctcgt gagacccagg aagtcacaga tacagggcat ctttagtgaa 1140 
ggggctgagc attaaaagca aggggaggag tggccaggtg caatggctca ctcccataaa 1200 
cccagaactt tgggaggcca aaatgagagg attgctgaga ccaggagttc gagaccatcc 1260 
tggtcaacat agtgatacac ccccatctct acaaaaataa aaatga 1306 



<210> 39 

<211> 851 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte ID No: 3406803CB1 

<400> 39 

gggctggcca cactgcaggg gctgcaggaa 
cagcataggg cacaggccaa agaaaacttt 
gacttgttct ctttggaata ggtcttcctc 
gctgtaacta tctgtgggct gttgggcaag 
gggtcctggc ccctcatggc tctgtgatca 
tgttggtgct cagcccagga gccctccttg 
gcctgccgca. accagtctag tgccttttct 
ctgcccttac agagcctgca gtccagggtg 
ggagaccttg agtggagggg aggtcaggaa 
tgaaagtacg atgaccagag ccttgtgtgt 
ggtgctgtgt gcagggcgtg cctgagaaga 
aaggcttgga ggccaatgag ggggctgatg 
aggccctcct gagtgggagc aaggctatgt 
gtggctgtac gaacagccat tgaggctgag 
gtgtcgagac t 



gcaaaggatg aaactgatct tttcactaac 60 
ggttactctc ttgtgagcca gttgaagtta 120 
tgcagaaata aaaacacttg tctgaaagag 180 
gacacttcag atactggctt tgagctcact 240 
tctctgctcc acattgcagg ccatgctccc 300 
ggtcctggcc agactcctcc accctcgtgg 3 60 
cccatggggg tccctggaat ctcacacacc 420 
ggagttaacc ctttccactt tcccagagag 480 
acgcggggct ggcagcatgg tgggaggagg 540 
gcggctgttg acaaagctga gggtatgatg 600 
ccactcctga tggagtgctg ggggaagttc 660 
gcacccatcc agtggagagg ctgtatttgg 720 
ggatgatgag atgaggtaag tgagatggaa 780 
gcaggaggat tgcctaagtc catagcccag 840 

851 



<210> 40 

<211> 2204 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 3468066CB1 

<400> 40 

gagcagtcct tgctggtccc gcccccgctc 
agtctccttg gcaaccactt gctcctcccc 
gtgcagtata tctcgcgctc tctccccttt 
aggttggtct ggaccggaag cgaagatggc 
gatcggctgg tgcatattcg gcctcttact 
tgttcgtaaa taccaaagtc ggcgggaaag 
ttctctagca attgcactta tcacatcagc 
ttacatgaaa aatcaaaatg gtacatttaa 
gattgaggac actgtattat acggttacta 
gttcttctgg atcccttttg tctacttcta 
taaatgtact caaattaaaa cggcactcaa 
actgcttctt ttagttggtg cctttgttcc 
agagtgggaa aaagtgaagt ccctatttga 
attgtcattt tctatcagtt ctctgacctt 
agcctatggc atgtctgcgt tacctttaaa 
tgaacgtttg gaaaacactg aagacattga 
atcaaaaagc aaagatggtc gacctttgcc 
tgaagaaagg ttacgaacac ttaagaagag 
ctggtggaca aaattttgtg gcgctctgcg 
catcttagtt gcattgctgt ttgtaatttc 
tcattcagct ggaatagatt ctggtttcat 
gaatatgctt ttgcctttac tacaaacagt 
tattattatg tactttattt ttacttcaat 
cttttggatt agattatata aaatcagaag 



ggctcgccgc caggggacgc tagtgggtcc 60 
ctccgcccct ttaaccttta gggtgcgcgg 120 
cccccccccc ttttcccacc ccgggcgctc 180 
gacttctggc gcggcctcgg cggagctggt 240 
actggctatt ttggcattct gctggatata 300 
tgaagttgtc tccaccataa cagcaatttt 360 
acttctacca gtggatatat ttttggtttc 420 
ggactgggct aatgctaatg tcagcagaca 480 
tactttatat tctgttatat tgttctgtgt 540 
ttatgaagaa aaggatgatg atgatactag 600 
gtatactttg ggatttgttg tgatttgtgc 660 
attgaatgtt cccaataaca aaaattctac 720 
agaacttgga agtagtcatg gtttagctgc 780 
gattggaatg ttggcagcta taacttacac 840 
tctgataaaa ggcactagaa gcgctgctta 900 
agaagtagaa caacacattc aaacgattaa 960 
agcaagggat aaacgcgcct taaaacaatt 1020 
agagaggcat ttagaattca ttgaaaacag 1080 
tcccctgaag atcgtctggg gaatattttt 1140 
tctcttcttg tcaaatttag ataaagctct 1200 
aatttttgga gctaacctga gtaatccact 1260 
tttccctctt gattatattc ttataacaat 1320 
ggcaggaatt cgaaatattg gcatatggtt 1380 
aggtagaacc aggccccaag cactcctttt 1440 
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tctctgcatg atacttctgc ttattgtcct tcacactagc tacatgattt atagtcttgc 1500 
tccccaatat gttatgtatg gaagccaaaa ttacttaata gagactaata taacttctga 1560 
taatcataaa ggcaattcaa ccctttctgt gccaaagaga tgtgatgcag aagctcctga 1620 
agatcagtgt actgttaccc ggacatacct attccttcac aagttctggt tcttcagtgc 1680 
tgcttactat tttggtaact gggcctttct tggggtattt ttgattggat taattgtatc 1740 
ctgttgtaaa gggaagaaat cggttattga aggagtagat gaagattcag acataagtga 1800 
tgatgagccc tctgtctatt ctgcttgaca gccttctgtc ttaaaggttt tataatgctg 1860 
actgaatatc tgttatgcat ttttaaagta ttaaactaac attaggattt gctaactagc 1920 
tttcatcaaa aatgggagca tggctataag acaactatat tttattatat gttttctgaa 1980 
gtaacattgt atcatagatt aacattttaa attaccataa tcatgctatg taaatataag 2040 
actactggct ttgtgaggga atgtttgtgc aaaatttttt cctctaatgt ataatagtgt 2100 
taaattgatt aaaaatcttc cagaattaat attccctttt gtcacttttt gaaaacataa 2160 
taaatcatct gtatctgtgc cttaggttct ccaaaaaaaa aaaa 2204 



<210> 41 
<211> 570 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 3592862CB1 

<400> 41 

gcgcggaggc tcggggagtc ggcgccatga ccccatcgag gcttccctgg ttgcttagct 60 
gggtctcggc cacggcgtgg agagcggcaa gatcacccct tctgtgtcat tctctgagga 120 
aaacaagttc ttctcaagga ggaaagtctg aacttgtcaa acagtccctt aagaagccga 180 
agttaccaga aggtcgtttt gatgcaccag aggattccca tttagagaaa gaaccactgg 240 
aaaaatttcc agatgatgtt aatccagtga ccaaagaaaa aggtggaccc aggggcccag 300 
aacctacccg atatggagat tgggaacgaa aaggacgctg tattgatttt taagtcgcat 360 
attctttaac ttcaatattg ttttctgaat atgtacatct gaattaactt atttctgatt 420 
attttctttc tttatatcct ttatgtcgtg tagtttgtgt aatgtgttta aatatatata 480 
tatatatata tatatatata tatatatatg ggggcttagg aagaaaatat gctgctgtaa 540 
attaggaaag ggagaccagc ctgaccaata 570 



<210> 42 
<211> 802 
<212> DNA 

<213> Homo sapiens 

<220> ' 

<221> misc_feature 

<223> ihcyte ID No: 3669422CB1 

<400> 42 

cagggtcaag gtgaagctgg tggtgtctcg aggcgggtga gtgtcatggg ggagcctggg 60 
tgggggtcac actggctctc tctagtccca tgtcgtcgtc ctcttcacga tgcctctccc 120 
cttccccagg gatgtctctg tggagctgcc ttttgttctt atgcacccca agccccacga 180 
ccacatcccc ctccccagac cccagtcagg tgagcacact acccacccca agccctcaga 240 
gggagggcct gaagcagggc cagtggagga aaactggccc ttccagcacc cacccccaca 300 
ccccctcttc ccgtcccccc agcccctctt cccttcccct cacctggaag cttcttcaac 360 
caatcccttc acactctctc ccccatcccc ccaagataca cactggaccc tctcttgctg 420 
aatgtgggca ttaatttttt gactgcagct ctgcttctcc agccccgccg tgggtggcaa 480 
gctgtgttca tacctaaatt ttctggaagg ggacagtgaa aagaggagtg acaggaggga 540 
aagggggaga caaaactcct actctcaacc tcacaccaac acctcccatt atcactctct 600 
ctgcccccat tccttcaaga ggagaccctt tggggacaag gccgtttctt tgtgaggaat 660 
aaaaaggtta gaagggcccc cctctctgaa ggcccccact ccctgggatg ctacaatcca 720 
atgatggaag atggcattag ctacaccacc ctgcgctttc ccgagatgaa cataccacga 780 
actggagatg cagagtcctc ag » 802 



<210> 43 
<211> 693 
<212> DNA 
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<213> Homo sapiens 



<220> 

<221> misc^feature 

<223> Incyte ID No: 3688740CB1 



<400> 43 

gttggtttaa tgggattgtg gaagagaatg 
atcagccacc gtccaagaac tgcacacatg 
gcgagcacaa ctcgacctcc tatgactctg 
tgatgctcct gggggtagtt gctgtagtca 
ccttcgccag ccattttctc tacaaagctg 
tttcttctct ctgttactca agcctctcaa 
cgtcttcagc catcaatgac atctcactcc 
ccagtcactg gacctgcatt acagtgggct 
taactcgttt tcccatttag ttgcaagaca 
tgttttaatg tttttcttgt aaatgcttta 
ccaaatcctt attgtttaaa agtttctctc 
gaatgaatgg atgaaatgca tacctgctta 



actccaatat ttggaagttc tggtacacca 60 
cttacctgtc tccgtacccc ttcatgagag 120 
cagttattta ccgtggtttc tgggcagtcc 180 
tcgcaagctt tttgatcatc tgtgcagccc 240 
ggggaggctc atatattgct gcagatggaa 300 
agtccttatt gtcccagcct ctgcgtgaaa 360 
ttcaagccct tatgccactg ctgggatgga 420 
tatattgacc tttacctcta cacttgtata 480 
cttggaagca cagaccaagg cttacatttg 540 
tgcctaaatg tttctgtact actcttcttt 600 
ctactatacc atgccttata aatattgatt 660 
taig 693 



<210> 44 
<211> 1212 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 3742589CB1 

<400> 44 

ccctcgaggc aacttgccct tctcaaacat 
ctctgggccc cgcctttgat ctcgttggtg 
gacaagtcgc cggcggcgcc cgacggagca 
tcagtgcagc cctccttgcc tttgtccaga 
tggatgaggt catcttctcc tatgtgcttg 
catcagagga gaacttcgat atggaggctt 
gcttcgccca catccccagg ggcacaatag 
tgagcgatgc caggaacaaa gagaacctgc 
tgcccatctc cccagagccc ctgcagcggc 
cggctgctgc tgctgcagac acccaagatg 
caggggtgga tgtactcctg gaggtgttcc 
tgctggccaa agctcggggg gacttggaag 
aagaggggcc tgcagcctgg gagggcccca 
cccaaaagga tgagctgaag tccttcatcc 
aggatcagaa gattcaccgg cccatggctc 
acatcgacaa ccaggtagtg agcaccaaag 
aggccgagga gatgaaggcc acatacatca 
attgaggcac tcgccggact ctgcccgagc 
gagccctata cccctacaca ggggccccct 
tccatagtgt taacctactc tcggagctgc 
aaaaaaaaaa aa 



ggccgccacg gcgcctctgg aagggaaccg 60 
gggctggggg atgagagctg caccgcgcgg 120 
gaacagagag catggagctg gagaggatcg 180 
cacacctccc ggaggccgac ctcagtggct 240 
gggtcctgga ggacctgggc ccctcgggcc 300 
tcactgagat gatggaggcc tatgtgcctg 360 
gggacatgat gcagaagctc tcagggcagc 420 
aaccgcagag ctctggtgtc caaggtcagg 480 
ccgaaatgct caaagaagag actaggtctt 540 
aggcaactgg cgctgaggag gagcttctgc 600 
ctacctgttc ggtggagcag gcccagtggg 660 
aagctgtgca gatgctggta gagggaaagg 720 
accaggacct gcccagacgc ctcagaggcc 780 
tgcagaagta catgatggtg gatagcgcag 840 
ccaaggaggc ccccaagaag ctgatccgat 900 
gggagcgatt caaagatgtg cggaaccctg 960 
acctcaagcc agccagaaag taccgcttcc 1020 
cttctaggct cagatcccag agggatgcag 1080 
aactcctgtc ccccttctct actcctttgc 1140 
ctccatgggc acagtaaagg tggcccaagg 12 00 

1212 
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